Gene Cyan8802_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1054 
Symbol 
ID8390363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1079488 
End bp1080621 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content48% 
IMG OID644979068 
Productthreonine synthase 
Protein accessionYP_003136821 
Protein GI257058933 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.470109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00082788 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGTTG ATCATCTAAC GCAACAATCG CAACCTAACC CCGTAGAATC CCCTAAGCTC 
CCCCCTTCAG ACCTAAAACC CTGGCGAGGG TTAATTGAAA CCTATCGCCC CTATCTTCCT
GTTACCGACA CAACTCCTGT CATTACCCTA CTTGAAGGGA ATACCCCTCT AATTCCCGTT
CCCTATATTT CCCAACAAAT CGGACGGGGA GTCAAAGTTT TGGTTAAATA CGATGGATTA
AACCCCACGG GAAGCTTTAA AGATCGCGGC ATGACTATGG CTATCTCCAA AGCCGTCGAA
AACGGGGCTA AAGCGGTTAT TTGTGCCAGT ACGGGGAATA CCTCAGCAGC AGCAGCAGCC
TACGCGAGAC GGGGCAAAAT GCGGGCATTC GTCATCATTC CTGATGGCTA TGTTGCCCTC
GGAAAACTAG CCCAAGCTTT ACTTTATGGG GCAGAAGTGA TCGCCATTGA TGGCAATTTT
GATGATGCCT TTAAGATTGT TCGAGGGATG GCCGAAAATT ACCCCGTAAC CTTGGTTAAT
TCCGTTAATC CCTATCGCTT AGAGGGGCAA AAAACGGCAG CCTTTGAAGT TGTTGATGTC
TTAGGCAATG CCCCCGACTG GTTGTGTATT CCCGTGGGGA ATGCCGGGAA TATTAGTGCC
TATTGGATGG GGTTTTGTCA ATATCATGGG TTAGGAAAGT GCGATCGCTT GCCAAAAATG
ATGGGCTTTC AAGCAGCCGG GGCTGCACCG TTTCTAACGG GTCAACCTGT ACCCCATCCT
GAAACCTTAG CAACTGCCAT TCGTATTGGC AACCCGGCTA ATTGGAACAA AGCTTGGGAA
ACCCAAAAAG CCAGTCACGG GGCGTTTAAT GGTGTCACCG ATGAGGAAAT TTTAGCAGCC
TATCGTATGT TGGCATCCCA AGAGGGGATT TTCTGTGAGC CAGCCAGTGC TGCTTCTGTG
GCAGGATTAT TAAAGGTTAA GGATCAAGTC CCCAGTGAAG CAACGGTCGT CTGTGTCCTG
ACGGGTAATG GACTTAAAGA TCCTGATTGT GCCATTAAAC ACAGCGATAA TCAACTTAAA
TCAGGGATTA AGGCTGATTT AGCTACAGTT GCTCAAGTGA TGGGGTTTGC GTAG
 
Protein sequence
MTVDHLTQQS QPNPVESPKL PPSDLKPWRG LIETYRPYLP VTDTTPVITL LEGNTPLIPV 
PYISQQIGRG VKVLVKYDGL NPTGSFKDRG MTMAISKAVE NGAKAVICAS TGNTSAAAAA
YARRGKMRAF VIIPDGYVAL GKLAQALLYG AEVIAIDGNF DDAFKIVRGM AENYPVTLVN
SVNPYRLEGQ KTAAFEVVDV LGNAPDWLCI PVGNAGNISA YWMGFCQYHG LGKCDRLPKM
MGFQAAGAAP FLTGQPVPHP ETLATAIRIG NPANWNKAWE TQKASHGAFN GVTDEEILAA
YRMLASQEGI FCEPASAASV AGLLKVKDQV PSEATVVCVL TGNGLKDPDC AIKHSDNQLK
SGIKADLATV AQVMGFA