Gene HMPREF0424_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1230 
SymbolthrB 
ID8709499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1462429 
End bp1463496 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content46% 
IMG OID646483318 
Producthomoserine kinase 
Protein accessionYP_003374423 
Protein GI283783669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0083] Homoserine kinase 
TIGRFAM ID[TIGR00191] homoserine kinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000185797 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCCAG TGTGTAACAG TGTTTCCGTG CGCGTGCCAG CGACTAGCGC AAATCTTGGT 
TCTGGTTTTG ATACATCCGG TATTGCGCTT GATTATGCGG ATTCTTTAGT TTTTACTCTG
GATGATAATC ATTCTAAAAA TTCGCTTGAT TATGCTGGAA ACGTGCGCGT AATTATTCAC
GGCGAAGGCG AGGATACGCT TCCAAAAGAT GAAACGCATC TAGTTGTAGC GACTTTTCGC
AAAGCTTGCA AAATATTTGG TCTGCCAAAC TTGCGATTTA CGTTAGAAGC GCATAATCGG
ATTCCGCAAG CGCGCGGTAT GGGCTCTTCT GCAGAAGCGA TTGTTGCTGC AGTTGCTGCT
GCTTGGGCTT TTGCGCACGA AGGTGAGCTT AATCGCGAAG CAATTTTCGA AATCGCGGCC
GCAATTGAAG GTCATCCAGA CAATGTTGCT CCTGCAGTTT TTGGCGGTTT AACGATGAGC
TGGAAGCTTG ATGCTGGGGA AGGCAAAGGT TCTGTGCGCG TGGCTGGAGA TTGTGAAACT
CCGCTTTCTA CCGGTTTTCA TACTGTGCGA TACAAAGTTT CGCAAGATAT TTCCGTGTCT
GTATTTGTTC CTGATTTCGA GCTTTCTACA GAGCTTGCTA GGCGCGCGCT TCCAGCAAAA
GTGCCGTATG GTGACGCAAT ATTTAACGTG TCGCGAGTTG CAATGCTACC GGTTGCTTTT
GGTGAGTTAA ATAGCGATTC TGAAAATGAA TTAGGCACAA TAAAATCTGA TATTTCGAGA
AATGCGTTAC TTTTTGCAGC AACTCAAGAT GCTCTTCATC AACCGTATCG TGCAAATCTT
ATGAAAGATT CATGGAAGCT AGTGGAAACG TTGCGCGAGC ATGGTTTTGC TGCAGCGATT
TCGGGAGCAG GATCGTGCGT TGCTGTATTT TATGCAGGAG ACGCTGAGTG CAATAAGAGT
GCGGTTCAAA AAATCGACGC GATTGCTGAG CCGTGGCTTT CGCGTGCTGG CTGGCGTGTT
TTGCATGTGC AGGTTGATTC TATTGGCGTT GCAATAACTC GCGAATAA
 
Protein sequence
MKPVCNSVSV RVPATSANLG SGFDTSGIAL DYADSLVFTL DDNHSKNSLD YAGNVRVIIH 
GEGEDTLPKD ETHLVVATFR KACKIFGLPN LRFTLEAHNR IPQARGMGSS AEAIVAAVAA
AWAFAHEGEL NREAIFEIAA AIEGHPDNVA PAVFGGLTMS WKLDAGEGKG SVRVAGDCET
PLSTGFHTVR YKVSQDISVS VFVPDFELST ELARRALPAK VPYGDAIFNV SRVAMLPVAF
GELNSDSENE LGTIKSDISR NALLFAATQD ALHQPYRANL MKDSWKLVET LREHGFAAAI
SGAGSCVAVF YAGDAECNKS AVQKIDAIAE PWLSRAGWRV LHVQVDSIGV AITRE