Gene Cyan8802_3088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3088 
Symbol 
ID8392418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3120717 
End bp3122303 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content53% 
IMG OID644981034 
ProductTonB family protein 
Protein accessionYP_003138766 
Protein GI257060878 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0726849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTT CTAATTTGTG TATTGAACAG CGTAATCAAG AAAAAGAAAT CCTCAAAAAG 
TTCATCCTCT ATGGACTCGC AGGTTCAGTT GCGCTGCATG GGTTACTGAT ATTAAGTCTC
AAATGGCTAC CAACCAGCGA AACCATGGCA GAAGAGCCCA TTGAACTGAT CATGATTGAA
GAACCCCAAG CCGAAATAGA GCCCCCAAAA CCCGAACCCG AACCGAAATT AGAAACTAAA
TTAACCCCAG ACCCCTTACC CCAACTCGAA CAGCCAAACA TCGAGCAATT TCAAGCCAGT
TCTCAAATAT CGTCAAATCC CGTGGCTCAA CCCTTGCCAC CCATGGCTAT CCCCTCCCAA
CCCGCGCAAG ACACCTCAGC CCCTGAACCT GCCCCTGCCA TTGAACCCCC CGTTGAACCT
CTCCCTAACC CTGACCCTCA ACCGGCTGCG TCCCTTCCCG AACCGGTCAC CTCAGCCCCT
CCCACTCTTC CCAAAGAACC TGTTCAAGCC CAAACCACTC CTGAAACACC TACCCCAGAG
ACAACCGTCG CCGTCGCAGA ACCCACCATG AGTCGTTCTG TGCCCAACCG TCCAAGCCTT
CCCGCTAACC CCTTAACCGC AGCTACAGAA ACCATCAAAA ACCTAGGCGA TCGCCTACGG
GGGAATCCTA GCTCTAATGC CCCAGAAACG GGCAATCCGT CAGGAAATCC CAGTAATCCA
GGGGGAGTTG CTGCCAATCG TTCCCGACCC GGTGGCGGCG CAGCTAGAAC CCAAGCTCTC
AGTTCTAACC CTGGCGGCGG GTTAGGACAG CTAAGAAGTG GTCTACAAGG CGGCACGGGA
ACTGGAAGCA CTAGGGGGAC GGGAAAGGGA ACTGGCAGTA GTACGGGAAG CGGTTCAAAC
CCCGGAAACC CTGGTAATGG GTCTGCTGCT GCCAATCGGG CTGCGCCAGG TCGTCCTGGC
AATTCTCAGG AACTTAGTAC CTCCGGGGCA GGGTGTACAG CCCCGGCTAA GCCCAATTTT
CCCACCGCTT TAGCCAATAA AGGCATTGAA GCCCGACCCG TGGTAGAAGT GATCACCAAT
GCCAGTGGTA AGGTGATTAC TGCCAATATT CGGTCATCGA GTGGCTATCC CCAGTTAGAT
CAATTGGCCA TCAATACGGC TAAAAATGTT CGCTGTCCAT CAGGGAATAG AGGAAGAAAA
CTCCAACTAG CTATCACTTT TGCCCAACAG GGCAGCACCT TAGAACAAGA AGCCCGACAA
CGACAAGCAG AACTCGAACG GCAGCGACAG GAAAAAGAAC GACAAGAAGC TGCCCGACGA
CAAGCAGAAA TCGAACAACA GCGACAAGCA GAAGCGCAAC GACGCGCAGA AGCCGAACGA
CAACGACAAG AAGAAGCAAA ACGACAAGCA GAGGAGGAAA GACAGCGTAA AGAACAAGAA
AGACAAGCAG AAGCCGAACG ACAGCGACAA GCGGAGTTAG AACAGCAACG TCAGCAGGAA
TTACAGCCCA AACCAGAATT AACTCCTGAA CCTGAACCTC CCACAACGGA ATTACCGCCC
CTAGACCCGG TTCCTTCCGT TGAATAA
 
Protein sequence
MSLSNLCIEQ RNQEKEILKK FILYGLAGSV ALHGLLILSL KWLPTSETMA EEPIELIMIE 
EPQAEIEPPK PEPEPKLETK LTPDPLPQLE QPNIEQFQAS SQISSNPVAQ PLPPMAIPSQ
PAQDTSAPEP APAIEPPVEP LPNPDPQPAA SLPEPVTSAP PTLPKEPVQA QTTPETPTPE
TTVAVAEPTM SRSVPNRPSL PANPLTAATE TIKNLGDRLR GNPSSNAPET GNPSGNPSNP
GGVAANRSRP GGGAARTQAL SSNPGGGLGQ LRSGLQGGTG TGSTRGTGKG TGSSTGSGSN
PGNPGNGSAA ANRAAPGRPG NSQELSTSGA GCTAPAKPNF PTALANKGIE ARPVVEVITN
ASGKVITANI RSSSGYPQLD QLAINTAKNV RCPSGNRGRK LQLAITFAQQ GSTLEQEARQ
RQAELERQRQ EKERQEAARR QAEIEQQRQA EAQRRAEAER QRQEEAKRQA EEERQRKEQE
RQAEAERQRQ AELEQQRQQE LQPKPELTPE PEPPTTELPP LDPVPSVE