Gene P9211_11401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_11401 
SymbolthrA 
ID5731354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1043585 
End bp1044901 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content43% 
IMG OID641285508 
Producthomoserine dehydrogenase 
Protein accessionYP_001551025 
Protein GI159903681 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.814686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGA ATATTGGTAT AGGCCTCCTA GGACTTGGAA CAGTAGGGAC TGGAGTAGCA 
CAGATAATCA ATTCTCCGGA AGGGCGCCAC CCTTTGACAT CTAGAGTCGA GTTAAAGCGA
ATAGCTCTCA GAGATTCTAA GAAAATCAGA GCTCTGTCAA TACCTAACAA ATTAATCACA
GAAGATGCCT GGGAAGTTGT CGAGGATCCT GATGTAGAAA TTGTTGTTGA AGTTATAGGA
GGTCTTGAGC CCGCAAGAAG TCTGATTCTC AAGGCAATCA AGTCCGGCAA ATCTGTAGTT
ACTGCAAACA AAGCTGTTAT CGCAAGACAT GGTGAGGAAA TTGCAGAAGC CGCAATTTCT
TCAGGTGTTT ATGTACTCAT AGAAGCAGCA GTAGGTGGAG GGATCCCAAT TATCGAACCG
TTAAAACAAT CACTAGGAGG CAACATAATT CAAAAGGTCA CTGGAATTGT GAATGGAACT
ACGAACTACA TTCTGACCAG GATGGCAAAG GAAGGTGCTG ACTACGAAGC CGTATTAAAA
GAAGCTCAAT CTCTTGGCTA TGCAGAGTCT GATCCCATGG CAGATGTAGA GGGCCTCGAT
GCAGCAGATA AGATCTCGAT ACTCAGCAAT CTTGCTTTTG GTGGGCCAAT TAAAAGAGCA
TCTGTACCAA CCAAAGGGAT AAGCACTCTT CAAAATAGAG ATGTTGACTA TGCAAATCAG
TTGGGTTATG AAGTCAAGCT TTTAGCTATA GCTGAGAGAC TTGCTAGCAA TCTTGAAAAC
AACTCTTCAC TCCCATTAGC AGTAAGAGTT GAGCCAACAC TATTACCAAC CGGCCATCCA
CTTGCAGAAG TTAATGGAGT AAACAACGCA ATTCTTGTTG AAGGAGATCC AATCGGAGAA
GTAATGTTCT ATGGACCTGG AGCAGGAGCA GGACCTACCG CCTCAGCAGT AGTGGCAGAC
ATACTTAATA TTGCAGGGAT AAAACTTATG GGAGGGGAAA AGACGTCTCT AGACCCTCTA
CTCTCAGCAT CTAGCTGGAG AGAATGCCAT TTAGCAAAGC CAAAAGAAAT TTTACAAAAG
AACTATGTCC GTCTCATTGC TAAAGATGCT CCAGGGGTAA TTGGTCAAAT TGGGAAAATA
TTTGGATCTC ACAATGTCTC AATTCAATCA ATAGTCCAAT TCGATGCTAG TGAAGAGGAT
GCGGAAATCG TTGTAATTAC TCACAAGGTG TTCAAAGGTT TACTGACAGA TTCTCTTTCT
GAGATACAGC AACTCCCAGA GATCAAACAA ATTGCAGCCC ATCTAAGTTG TCTTTAA
 
Protein sequence
MTKNIGIGLL GLGTVGTGVA QIINSPEGRH PLTSRVELKR IALRDSKKIR ALSIPNKLIT 
EDAWEVVEDP DVEIVVEVIG GLEPARSLIL KAIKSGKSVV TANKAVIARH GEEIAEAAIS
SGVYVLIEAA VGGGIPIIEP LKQSLGGNII QKVTGIVNGT TNYILTRMAK EGADYEAVLK
EAQSLGYAES DPMADVEGLD AADKISILSN LAFGGPIKRA SVPTKGISTL QNRDVDYANQ
LGYEVKLLAI AERLASNLEN NSSLPLAVRV EPTLLPTGHP LAEVNGVNNA ILVEGDPIGE
VMFYGPGAGA GPTASAVVAD ILNIAGIKLM GGEKTSLDPL LSASSWRECH LAKPKEILQK
NYVRLIAKDA PGVIGQIGKI FGSHNVSIQS IVQFDASEED AEIVVITHKV FKGLLTDSLS
EIQQLPEIKQ IAAHLSCL