Gene P9301_11571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_11571 
SymbolthrA 
ID4912040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp970319 
End bp971620 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content32% 
IMG OID640160743 
Producthomoserine dehydrogenase 
Protein accessionYP_001091381 
Protein GI126696495 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.292612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT GCAAAATTGG GATTGTAGGT TTTGGAACTG TAGGTTCAGG GATTTATAAA 
ATATTAAGTT CTGAAGTTGA TTCACATCCA ATTCTAAAAG AAATAGAAAT TGCAAAAATA
GCAGTTAAAG ATCTTAATAA AAAAAGGGAT ATTGAGCTAG ATAATAATTT ATTAACTGAT
GATCCATTTA AATTAATTAA TGACCCCTCT ATTGATGTAA TTGTTGAAGT AATGGGTGGG
GTTGATTTAG CTAAAGATAT TATTCTGCAA TCTTTAAAAT TAGGTAAATC TGTTGTTACA
GCAAATAAAG CAGTTATCGC AAGATATGGA GAAGAAATAT ATAAAACTGC ATCTAAAGAA
AGAGTTTATA TATTGTCAGA GGCAGCAGTT TGCGGAGGGA TTCCTATCAT TGAACCCTTA
AAAAGATCAT TAAAAAGTAA CTGTATAAAA AGAATGGTTG GGATAATAAA TGGCACAACA
AATTTTATTC TTTCAAAGAT GACAAATGAA AAAGCTGATT ACAAGGAGAC CTTAAAATTG
GCTCAAAGCC TTGGATATGC AGAATTTGAT CCAACTGCAG ATGTTGAGGG CCATGATGCT
GCTGATAAGA TTTCAATTCT TAGTGAACTT GCATTCGGAG GGAAAATCAA AAGAGAGGAG
ATTCATTTTG AGGGCATTAG TAAAATTAAT CTAAAGGATA TTGAATATGC CAATAAATTA
GGGTTTGAAA TAAAACTTTT AGCGCTATCC GAAAGGGGAC AAATTAATAG TAATGATTCA
CTCGCTTTAA ATATTTGGGT AGGACCTTCT TTGATTCCAA AATCTCATCC ATTGTCAACA
GTTAAGGGAG TTAATAATGC CTTATTGATT GAAGCTGATC CTCTTGGTGA AATAATGTTA
TATGGTCCAG GTGCAGGGAG TGGCCCAACT GCAGCGTCAG TGGTATCAGA TATATTAAAT
CTGCATGCCG CCAAAGAAAA AAATAATAAT TCAGTCGATC CATTATTATC TTTTGATTTC
TGGAGAAACT GCCATATAAC AAGCTCCTCA CAAATAAATA AAAAAAATTA CCTTAGAATT
ATTTGTCTTG ATAGTCCAGG TGTCATAGGA AAGATTGGAG ATATTTTTGG AAAGAATAAT
GTATCAATTG AATCAATTGT TCAACTTGAT GCTAGTGAGG ACAAAGCTGA AATTGTCGTT
ATTACTCATG AGGTGAATAA TGGAGATTTT GAGAGATCGA AAGATGAAAT AAATTCGCTA
AATGAAGTCA AAATTATTGC AAGTCAATTA AGTTGTATTT AA
 
Protein sequence
MRKCKIGIVG FGTVGSGIYK ILSSEVDSHP ILKEIEIAKI AVKDLNKKRD IELDNNLLTD 
DPFKLINDPS IDVIVEVMGG VDLAKDIILQ SLKLGKSVVT ANKAVIARYG EEIYKTASKE
RVYILSEAAV CGGIPIIEPL KRSLKSNCIK RMVGIINGTT NFILSKMTNE KADYKETLKL
AQSLGYAEFD PTADVEGHDA ADKISILSEL AFGGKIKREE IHFEGISKIN LKDIEYANKL
GFEIKLLALS ERGQINSNDS LALNIWVGPS LIPKSHPLST VKGVNNALLI EADPLGEIML
YGPGAGSGPT AASVVSDILN LHAAKEKNNN SVDPLLSFDF WRNCHITSSS QINKKNYLRI
ICLDSPGVIG KIGDIFGKNN VSIESIVQLD ASEDKAEIVV ITHEVNNGDF ERSKDEINSL
NEVKIIASQL SCI