Gene A9601_19251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19251 
SymbolthrC 
ID4718665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1668615 
End bp1669718 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content32% 
IMG OID640079660 
Productthreonine synthase 
Protein accessionYP_001010314 
Protein GI123969457 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.373874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTTAT TAAATAAAAT AAAAAATAAA CTGCGTATTA ATTACAGAAA AAAAAGATGG 
CCAGGTCTAA TAGAAGCTTA TAAACAATAT CTTCCAGTTA CAAAGAAAAC TCCTATTATT
TCCCTAAATG AAGGAAATAC ACCACTAATC CTAAGCGAGT CAATTAGCAA CTTAATTGGA
AATAGAACAA AAGTTTTTTT AAAATATGAT GGCCTTAATC CAACTGGATC TTTTAAAGAT
CGTGGAATGA CTATGGCAAT TAGCAAAGCA AAAGAAGAAG GACGAGAAGC AGTAATTTGT
GCAAGTACTG GAAATACATC TGCTGCTGCT GCTGCATATG CTTCGAGAGG AGGATTAAAA
CCTTATGTTT TAATTCCAGA AGGATTTGTT GCACAAGGAA AGCTTGCGCA AGCATTAATG
TATGGTGCTG AGATAATATC TATTAACGGA AACTTTGATA AGGCTCTTGA AATTGTTAGA
GATTTATCCT CAGAACATCC TATAGAACTT GTTAATTCTG TTAATCCATA TCGAATACAA
GGACAAAAAA CAGCAGCTTT TGAAATAGTT GATGACTTAG GTTATGCTCC TGATTGGCTT
TGTATTCCTA TGGGTAATGC AGGAAACATA ACTGCTTATT GGATGGGATT TAAAGAATAT
TCAAAAATAA AAAGCAATTT GAAATTACCA ATAATGATGG GTTTTCAGTC CGAAGGCTCT
GCTCCATTAG TAAAAAATAT AATAGTTAAG GATCCAGAAA CAATTGCAAC TGCAATAAGA
ATTGGAAATC CTGTAAATAG AGAAAAAGCC AAAAAAGTAA GGAAGGAGAG TAAAGGAGAC
TTTCAATCAG TTACAGATGA AGAAATAATC AATGCTTATA AAATTCTTGC CAAAGAGGGA
GTATTTTGTG AACCTGCCAG TGCAGCATCA GTTGCTGGAC TAATTAAAAA TAAAAATAGA
ATTCAGAAAG AATCGACTAT TGTTTGTGTT CTGACTGGAA ATGGATTGAA AGATCCTGAT
TGCGCTATTA AAAATAACGA TGCTATTTTT AGGAAAAATA TTGAACCTTC ATTAAAAAAT
ATAACTAAAA TCTTAGGATA TTAA
 
Protein sequence
MVLLNKIKNK LRINYRKKRW PGLIEAYKQY LPVTKKTPII SLNEGNTPLI LSESISNLIG 
NRTKVFLKYD GLNPTGSFKD RGMTMAISKA KEEGREAVIC ASTGNTSAAA AAYASRGGLK
PYVLIPEGFV AQGKLAQALM YGAEIISING NFDKALEIVR DLSSEHPIEL VNSVNPYRIQ
GQKTAAFEIV DDLGYAPDWL CIPMGNAGNI TAYWMGFKEY SKIKSNLKLP IMMGFQSEGS
APLVKNIIVK DPETIATAIR IGNPVNREKA KKVRKESKGD FQSVTDEEII NAYKILAKEG
VFCEPASAAS VAGLIKNKNR IQKESTIVCV LTGNGLKDPD CAIKNNDAIF RKNIEPSLKN
ITKILGY