Gene P9211_18541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_18541 
SymbolthrC 
ID5730006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1687695 
End bp1688801 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content33% 
IMG OID641286241 
Productthreonine synthase 
Protein accessionYP_001551739 
Protein GI159904395 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.210401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0322921 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCATTT TAAAAAGACT CCCAAGGATT TTCGACAGAA GCCATGAATA CAAACATTGG 
CCAGGATTAA TCAAAGCATA TAAAAAATGG CTTCCAGTCA CTGATAAAAC TCCTATAATC
ACGCTTCAAG AGGGAGCAAC TCCATTAATA CCTTTGAAAT CAATTAATGA ATTAATTGGT
AAAAATGTCA AGATTTTTGT CAAGTATGAC GGTTTAAACC CAACTGGATC TTTTAAAGAT
AGAGGGATGA CAATGGCAAT AAGTAAAGCT AAAGAAGATA ATTGTGAAGC AGTGATTTGC
GCTAGCACTG GAAACACATC TGCTTCTGCA GCAGCTTATG CAAAAAGAGG AGGAATGAAA
AGTTTTGTTT TGATCCCAGA TGGGTATGTA GCACAAGGAA AGCTTGCTCA AGCACTTGTT
TATGGTGCTG AAGTTTTAGC AATTAAAGGA AACTTTGATA AAGCATTAAA TATTGTCCAA
GAGTTATCAA ATAGATACCC AATAACTTTA GTAAATTCCG TTAATCCTTA TCGTATTCAA
GGGCAAAAAA CTGCAGCTTT TGAAATTATA GACTCAATTG GTGAAGCACC AGACTGGTTA
TGTATTCCAA TGGGAAATGC AGGAAATATT ACAGCATATT GGATGGGGTT TGAAGAATAT
TATAAAGCTG GGAAAAGTAA AAAGTTACCT AAAATGATGG GTTTTCAAGC TATTGGCTCT
GCACCTTTAG TTTTTAATAA AACAATTGAA AATCCAGAAA CAATAGCAAC AGCAATAAGA
ATCGGAAACC CTGTTAATAA AGAAAAAGCA TTTATTGTAA AAGGTAAAAG TAAAGGTAAA
TTTACTGCTG TTACTGATAA AGAAATTATA GATGCTTATA AACTTTTAGG TAAAGAAGAA
GGTATTTTTT GTGAACCAGC AAGTGCAGCA TCTATAGCTG GATTAATAAA ATTGAAAGAA
GAAATTCCTC CAAATTCTAC AATTGTTTGT GTTTTAACAG GTAATGGTTT AAAAGATCCT
GATTGTGCAA TTAATAATAA TGATGCATCT TTTAAAAATA ATTTAGAACC TAATTCTGAT
GAAATAGCAA AAGCCATAGG ATTCTAA
 
Protein sequence
MSILKRLPRI FDRSHEYKHW PGLIKAYKKW LPVTDKTPII TLQEGATPLI PLKSINELIG 
KNVKIFVKYD GLNPTGSFKD RGMTMAISKA KEDNCEAVIC ASTGNTSASA AAYAKRGGMK
SFVLIPDGYV AQGKLAQALV YGAEVLAIKG NFDKALNIVQ ELSNRYPITL VNSVNPYRIQ
GQKTAAFEII DSIGEAPDWL CIPMGNAGNI TAYWMGFEEY YKAGKSKKLP KMMGFQAIGS
APLVFNKTIE NPETIATAIR IGNPVNKEKA FIVKGKSKGK FTAVTDKEII DAYKLLGKEE
GIFCEPASAA SIAGLIKLKE EIPPNSTIVC VLTGNGLKDP DCAINNNDAS FKNNLEPNSD
EIAKAIGF