Gene NATL1_22001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_22001 
SymbolthrC 
ID4779345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1863437 
End bp1864543 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content36% 
IMG OID640085498 
Productthreonine synthase 
Protein accessionYP_001016020 
Protein GI124026905 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.821468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCATTTT CTAAGAAATT AAAACAACTT TTCACTTCTA CCCCACCCAA AAACAATTGG 
GATGGACTTA TAGAAACATA CAAAACCTGG TTACCAGTAA GTAATAAGAC ACCCATCATT
ACTTTAAAAG AAGGCGCAAC TCCTCTTATT GAAGTCAATT CCATTTCTAA TCGAATTGGA
AATGGAGTAA AAGTATTTGT CAAATATGAC GGTCTAAATC CGACTGGATC CTTCAAAGAC
AGAGGAATGA CTATGGCGAT AAGTAAAGCT AAAGAAGCTG GGTGTGAAGC TGTAATTTGC
GCAAGCACAG GTAATACTTC AGCCTCTGCA GCAGCTTATG CAAGCAAAGG AGGAATGAAA
TCCTTTGTTA TTATTCCAGA TGGATATGTA GCTCAAGGTA AACTAGCTCA AGCTTTAGTT
TACGGAGCTG AAGTATTAGC TATTAAAGGG AATTTTGATA AAGCACTAGA TATTGTTAGA
GAACTATCTA ATAAATATCC AATTACTTTA GTGAACTCGG TAAATCCTTA CAGATTACAA
GGACAAAAAA CCGCAGCATT TGAGATAATA GAAAATCTTG GAAATGCACC TGATTGGTTA
TGCATTCCAA TGGGAAATGC TGGAAATATA AGTGCTTACT GGATGGGATT TCAAGAATTT
TTTCACGCTG GAAAATCAAA ACATCTCCCA AGAATGATGG GTTTTCAAGC GAGTGGTTCT
GCGCCTTTAG TCGAAGGTAA AAGTATCAGT AACCCTGAGA CAATTGCTAC AGCAATAAGA
ATTGGTAATC CCGTTAATAG AGAAAAAGCT CTTTTAGCAA AAGAATCAAG CAATGGTAGA
TTTTCTTCTG TATCAGATTC AGAAATAATC AACGCATATA AAATTCTTGG TAGAGAAGAA
GGCATTTTTT GTGAGCCTGC AAGTGCAGCT TCTGTGGCTG GTTTATTAAA AATCAAAGAA
GATGTCCCAA AAAATTCAAC TATTGTTTGT GTTTTGACAG GGAATGGTCT AAAAGATCCT
GATTGCGCAA TAAAAAATAA TGACTCAATG TTTCATACGG GAGTTGAACC AGAAATAAGT
TCAATAACAA AAATAATGGG CTTTTAA
 
Protein sequence
MSFSKKLKQL FTSTPPKNNW DGLIETYKTW LPVSNKTPII TLKEGATPLI EVNSISNRIG 
NGVKVFVKYD GLNPTGSFKD RGMTMAISKA KEAGCEAVIC ASTGNTSASA AAYASKGGMK
SFVIIPDGYV AQGKLAQALV YGAEVLAIKG NFDKALDIVR ELSNKYPITL VNSVNPYRLQ
GQKTAAFEII ENLGNAPDWL CIPMGNAGNI SAYWMGFQEF FHAGKSKHLP RMMGFQASGS
APLVEGKSIS NPETIATAIR IGNPVNREKA LLAKESSNGR FSSVSDSEII NAYKILGREE
GIFCEPASAA SVAGLLKIKE DVPKNSTIVC VLTGNGLKDP DCAIKNNDSM FHTGVEPEIS
SITKIMGF