Gene HMPREF0424_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1231 
Symbol 
ID8708802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1463516 
End bp1464886 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content48% 
IMG OID646483319 
Producthomoserine dehydrogenase 
Protein accessionYP_003374424 
Protein GI283783670 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000217271 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAAGCA ATCAGTCGAC TAAAACTATT CGCGTAGGCC TACTTGGGGC TGGAACTGTT 
GGGTCTCAAA CAGCGCGACT AATAGTTGAA CAGTTCAATG AGCTTAAGAA GCGAACTGGC
GCTGAAATTG AGCTTGCAGC AGTTGCATGC TTACGCCCGG AAGAAGTTGA CGCTCCTTGG
ATTAAGCGTG ATTTGCTAAC TACAGACACT GCCTCTTTGT GCGCTCGCGA AGACATTGAT
ATTATAGTCG AGCTTATTGG CGGACTTGAG CCGGCTCATA CTTTTGTAAA AAGTGCGCTA
TGCCACGGAA AATCTGTTGT TACTGCAAAT AAAGCGTTGC TTGCGAAGTT TGGTCCGGAA
TTATACGAGT GTGCAGAAAG TCACGGAGTT GACTTGTACT TTGAGGCGGC TGTAGCGGGA
GCGATTCCTA TTGTTCGACC GCTTCGAGAG TCGCTTATTG GTGACAAAAT TACGCAAATT
TTCGGAATTG TAAACGGAAC TACCAACTAT ATTCTTGACG AAATGACTGT GCGCGGACTT
GATTTTGATT TGGTTTTGCA CGCGGCTCAA GAAAAGGGTT ATGCTGAGGC AGATCCGACC
GGAGATGTTG AAGGTTTCGA TGCTGCAAAT AAAGCTGCGA TTCTTGCAAC TCTTGCATTC
CAAATGCCAG TAAGTATTGA CGATGTATCC GTTGAAGGAA TTAGCGCGAT TACTGCTGAA
GATATTGCTG CAGCGAGCGC GGAAAAGCGT GTAATTAAGC TACTTGCAGC AGTCGAAAGG
CATTGCGATG GTAAGTCAAA ATCGGATGGC GGAGTAAGTG TTAATGTGTA TCCGGCGCTT
GTTGGTGCGG AGCACCCTTT GGCGTCGGTT CACGGCAGTT TTAATGCTGT GTTTGTGAAG
GCGCAGGCTG CGGACGATTT GATGTTCTAC GGACGCGGTG CCGGCGGTGC TCCAACTGCA
AGTGCTGTTG TTGGAGATGT TGTTAGTGCT GCTCGAAATC TTGTGCGTGG ATGCGCAGGT
TTTGGCGTGC CAATGTATAA CAAGTATGTG CCGGCTTCTA GCGAGCAGAC TAGAGCGGAT
TTTGTGATTC GTTGCAATAT GGAAGATACT TCTTTGGCTT TGTGCAGCGA AGTTATGGAT
GTTTTTGTAG ATTATGGCGT TGCAGCTGAA CGTTTGGCTT CGGCGTGCCA AGCTAAGTAT
GCGCAAACTG AAGCTGATTC GGATTGCCCA CAGTGCGGTC TTGGTGGACC TGGCAGTGTG
CGCGTGCTTG TGCGCGAATG TTCGGAAGCT GCAGTTCAAG CTATTTGTGA GGATTTGCAG
AAGTTGGATG TTGTGTGTGG AAAGCCGCTA GTTTTGCGCG TTATAAAGTA G
 
Protein sequence
MQSNQSTKTI RVGLLGAGTV GSQTARLIVE QFNELKKRTG AEIELAAVAC LRPEEVDAPW 
IKRDLLTTDT ASLCAREDID IIVELIGGLE PAHTFVKSAL CHGKSVVTAN KALLAKFGPE
LYECAESHGV DLYFEAAVAG AIPIVRPLRE SLIGDKITQI FGIVNGTTNY ILDEMTVRGL
DFDLVLHAAQ EKGYAEADPT GDVEGFDAAN KAAILATLAF QMPVSIDDVS VEGISAITAE
DIAAASAEKR VIKLLAAVER HCDGKSKSDG GVSVNVYPAL VGAEHPLASV HGSFNAVFVK
AQAADDLMFY GRGAGGAPTA SAVVGDVVSA ARNLVRGCAG FGVPMYNKYV PASSEQTRAD
FVIRCNMEDT SLALCSEVMD VFVDYGVAAE RLASACQAKY AQTEADSDCP QCGLGGPGSV
RVLVRECSEA AVQAICEDLQ KLDVVCGKPL VLRVIK