Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3033 |
Symbol | |
ID | 7105984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3176231 |
End bp | 3177817 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643476060 |
Product | TonB family protein |
Protein accession | YP_002373173 |
Protein GI | 218247802 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTTT CTAATTTGTG TATTGAACAG CGTAATCAAG AAAAAGAAGT CCTCAAAAAG TTCATCCTCT ATGGACTCGC AGGTTCAGTT GCGCTGCATG GGTTACTGAT ATTAAGTCTC AAATGGCTAC CAACCAGCGA AACCATGGCA GAAGAGCCCA TTGAACTGAT CATGATTGAA GAACCCCAAG CCGAAATAGA GCCCCCAAAA CCCGAACCCG AACCGAAATT AGAAACTAAA TTAACCCCAG ACCCCTTACC CCAACTCGAA CAGCCAAACA TCGAGCAATT TCAAGCCAGT TCTCAAATAT CGTCAAATCC CGTGGCTCAA CCCTTGCCAC CCATGGCTAT CCCCTCCCAA CCCGCGCAAG ACACCTCAGC CCCTGAACCT GCCCCTGCCA TTGAACCCCC CGTTGAACCT CTCCCTAACC CTGACCCTCA ACCGGCTGCG TCCCTTCCCG AACCGGTCAC CTCAGCCCCT CCCACTCTTC CCAAAGAACC TGTTCAAGCC CAAACCACTC CTGAAACACC TACCCCAGAG ACAACCGTCG CCGTCGCAGA ACCCACCATG AGTCGTTCTG TGCCCAACCG TCCAAGCCTT CCCGCTAACC CCTTAACCGC AGCTACAGAA ACCATCAAAA ACCTAGGCGA TCCCCTACGG GGGAATCCTA GCTCTAATGC CCCAGAAACG GGCAATCCGT CAGGAAATCC CAGTAATCCA GGGGGAGTTG CTGCCAATCG TTCCCGACCC GGTGGCGGCG CAGCTAGAAC CCAAGCTCTC AGTTCTAACC CTGGCGGCGG GTTAGGACAG CTAAGAAGTG GTCTACAAGG CGGCACGGGA ACTGGAAGCA CTAGGGGGAC GGGAAAGGGA ACTGGCAGTA GTACGGGAAG CGGTTCAAAC CCCGGAAACC CTGGTAATGG GTCTGCTGCT GCCAATCGGG CTGCTCCAGG TCGTCCTGGC AATTCTCAGG AACTTAGTAC CTCCGGGGCA GGGTGTACAG CCCCGGCTAA GCCCAATTTT CCCACCGCTT TAGCCAATAA AGGCATTGAA GCCCGACCCG TGGTAGAAGT GATCACCAAT GCCAGTGGTA AGGTGATTAC TGCCAATATT CGGTCATCGA GTGGCTATCC CCAGTTAGAT CAATTGGCCA TCAATACGGC TAAAAATGTT CGCTGTCCAT CAGGGAATAG AGGAAGAAAA CTCCAACTAG CTATCACTTT TGCCCAACAG GGCAGCACCT TAGAACAAGA AGCCCGACAA CGACAAGCAG AACTCGAACG GCAGCGACAG GAAAAAGAAC GACAAGAAGC TGCCCGACGA CAAGCAGAAA TCGAACAACA GCGACAAGCA GAAGCGCAAC GACGCGCAGA AGCCGAACGA CAACGACAAG AAGAAGCAAA ACGACAAGCA GAGGAGGAAA GACAGCGTAA AGAACAAGAA AGACAAGCAG AAGCCGAACG ACAGCGACAA GCGGAGTTAG AACAGCAACG TCAGCAGGAA TTACAGCCCA AACCAGAATT AACTCCTGAA CCTGAACCTC CCACAACGGA ATTACCGCCC CTAGACCCCG TTCCTTCCGT TGAATAA
|
Protein sequence | MSLSNLCIEQ RNQEKEVLKK FILYGLAGSV ALHGLLILSL KWLPTSETMA EEPIELIMIE EPQAEIEPPK PEPEPKLETK LTPDPLPQLE QPNIEQFQAS SQISSNPVAQ PLPPMAIPSQ PAQDTSAPEP APAIEPPVEP LPNPDPQPAA SLPEPVTSAP PTLPKEPVQA QTTPETPTPE TTVAVAEPTM SRSVPNRPSL PANPLTAATE TIKNLGDPLR GNPSSNAPET GNPSGNPSNP GGVAANRSRP GGGAARTQAL SSNPGGGLGQ LRSGLQGGTG TGSTRGTGKG TGSSTGSGSN PGNPGNGSAA ANRAAPGRPG NSQELSTSGA GCTAPAKPNF PTALANKGIE ARPVVEVITN ASGKVITANI RSSSGYPQLD QLAINTAKNV RCPSGNRGRK LQLAITFAQQ GSTLEQEARQ RQAELERQRQ EKERQEAARR QAEIEQQRQA EAQRRAEAER QRQEEAKRQA EEERQRKEQE RQAEAERQRQ AELEQQRQQE LQPKPELTPE PEPPTTELPP LDPVPSVE
|
| |