Gene PCC8801_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3033 
Symbol 
ID7105984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3176231 
End bp3177817 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content53% 
IMG OID643476060 
ProductTonB family protein 
Protein accessionYP_002373173 
Protein GI218247802 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTTT CTAATTTGTG TATTGAACAG CGTAATCAAG AAAAAGAAGT CCTCAAAAAG 
TTCATCCTCT ATGGACTCGC AGGTTCAGTT GCGCTGCATG GGTTACTGAT ATTAAGTCTC
AAATGGCTAC CAACCAGCGA AACCATGGCA GAAGAGCCCA TTGAACTGAT CATGATTGAA
GAACCCCAAG CCGAAATAGA GCCCCCAAAA CCCGAACCCG AACCGAAATT AGAAACTAAA
TTAACCCCAG ACCCCTTACC CCAACTCGAA CAGCCAAACA TCGAGCAATT TCAAGCCAGT
TCTCAAATAT CGTCAAATCC CGTGGCTCAA CCCTTGCCAC CCATGGCTAT CCCCTCCCAA
CCCGCGCAAG ACACCTCAGC CCCTGAACCT GCCCCTGCCA TTGAACCCCC CGTTGAACCT
CTCCCTAACC CTGACCCTCA ACCGGCTGCG TCCCTTCCCG AACCGGTCAC CTCAGCCCCT
CCCACTCTTC CCAAAGAACC TGTTCAAGCC CAAACCACTC CTGAAACACC TACCCCAGAG
ACAACCGTCG CCGTCGCAGA ACCCACCATG AGTCGTTCTG TGCCCAACCG TCCAAGCCTT
CCCGCTAACC CCTTAACCGC AGCTACAGAA ACCATCAAAA ACCTAGGCGA TCCCCTACGG
GGGAATCCTA GCTCTAATGC CCCAGAAACG GGCAATCCGT CAGGAAATCC CAGTAATCCA
GGGGGAGTTG CTGCCAATCG TTCCCGACCC GGTGGCGGCG CAGCTAGAAC CCAAGCTCTC
AGTTCTAACC CTGGCGGCGG GTTAGGACAG CTAAGAAGTG GTCTACAAGG CGGCACGGGA
ACTGGAAGCA CTAGGGGGAC GGGAAAGGGA ACTGGCAGTA GTACGGGAAG CGGTTCAAAC
CCCGGAAACC CTGGTAATGG GTCTGCTGCT GCCAATCGGG CTGCTCCAGG TCGTCCTGGC
AATTCTCAGG AACTTAGTAC CTCCGGGGCA GGGTGTACAG CCCCGGCTAA GCCCAATTTT
CCCACCGCTT TAGCCAATAA AGGCATTGAA GCCCGACCCG TGGTAGAAGT GATCACCAAT
GCCAGTGGTA AGGTGATTAC TGCCAATATT CGGTCATCGA GTGGCTATCC CCAGTTAGAT
CAATTGGCCA TCAATACGGC TAAAAATGTT CGCTGTCCAT CAGGGAATAG AGGAAGAAAA
CTCCAACTAG CTATCACTTT TGCCCAACAG GGCAGCACCT TAGAACAAGA AGCCCGACAA
CGACAAGCAG AACTCGAACG GCAGCGACAG GAAAAAGAAC GACAAGAAGC TGCCCGACGA
CAAGCAGAAA TCGAACAACA GCGACAAGCA GAAGCGCAAC GACGCGCAGA AGCCGAACGA
CAACGACAAG AAGAAGCAAA ACGACAAGCA GAGGAGGAAA GACAGCGTAA AGAACAAGAA
AGACAAGCAG AAGCCGAACG ACAGCGACAA GCGGAGTTAG AACAGCAACG TCAGCAGGAA
TTACAGCCCA AACCAGAATT AACTCCTGAA CCTGAACCTC CCACAACGGA ATTACCGCCC
CTAGACCCCG TTCCTTCCGT TGAATAA
 
Protein sequence
MSLSNLCIEQ RNQEKEVLKK FILYGLAGSV ALHGLLILSL KWLPTSETMA EEPIELIMIE 
EPQAEIEPPK PEPEPKLETK LTPDPLPQLE QPNIEQFQAS SQISSNPVAQ PLPPMAIPSQ
PAQDTSAPEP APAIEPPVEP LPNPDPQPAA SLPEPVTSAP PTLPKEPVQA QTTPETPTPE
TTVAVAEPTM SRSVPNRPSL PANPLTAATE TIKNLGDPLR GNPSSNAPET GNPSGNPSNP
GGVAANRSRP GGGAARTQAL SSNPGGGLGQ LRSGLQGGTG TGSTRGTGKG TGSSTGSGSN
PGNPGNGSAA ANRAAPGRPG NSQELSTSGA GCTAPAKPNF PTALANKGIE ARPVVEVITN
ASGKVITANI RSSSGYPQLD QLAINTAKNV RCPSGNRGRK LQLAITFAQQ GSTLEQEARQ
RQAELERQRQ EKERQEAARR QAEIEQQRQA EAQRRAEAER QRQEEAKRQA EEERQRKEQE
RQAEAERQRQ AELEQQRQQE LQPKPELTPE PEPPTTELPP LDPVPSVE