Gene HMPREF0424_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0678 
Symbol 
ID8709152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp765614 
End bp766714 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content40% 
IMG OID646482783 
ProductHAD hydrolase, family IIA 
Protein accessionYP_003373906 
Protein GI283783152 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTTG AAACGAAAAA GAATTTTTCT TCATGTAATC GTCCGTTAAG CGATGCATTT 
CGTTTAGCAC TGCTTGATTT AGATGGTGTT GTATATCGTG GTGGAAATGC TGTTGAATAT
GCGTCTGATT CCATTTTGTT TGCGCAAAAA AATGGAATGG CAATTGAATA CACAACTAAT
AATTCTTCGC GTTTCCAATC TGTTGTTGCA AAACAACTGG AAAGTTTTGG CTTAAAAGTA
GAACCATGGC AGATTATAAC GTCTTCTGTT GTTGCAGCAA GGATGGTAGC TCGCAATGTT
GAGAAAGGAT CCAAAGTTCT TGTTCTCGGC GCTGAACATT TGCGTCAAGA AGTGCAACGC
GTAGGATTAC AACTCGTAGA TTCGTGCGAA GATAATCCTA AAGCTGTTAT TCAAGGCTGG
TATCCTCAAA TGACTTGGCA AGAAATGGCG GAAGTTTCTT TCGCTGTTGA GCATGGTGCA
AAATATTTCG TTACTAACCG CGACTTAACT ATTCCAAGAG AGCATGGTAT TGCTCCTGGA
TGCGGTTCCA TGATTCAAGC TGTTATTAAT GCTACGGGTG TAGAGCCTAT TTCGTCTGCT
GGAAAACCAG AATCTGCAAT GTATGATGAA GCAAGGTTTT TGGTTGCTGC TAATGCTAAG
CATGATGATT CTGAATGTGA AGAATACACT GAAAAAGACG AGTATGGGAA TCCTGTAATT
AGTATTGAAC ATTCATTAGC AGTTGGAGAT CGTTTAGATA CTGATATTGA GGCTGGAACA
AGAGGAGGCT ATGCTTCATT ACTTGTTCTT ACTGGAGTTA CAGATCCTCG CATGCTTATG
CTTGCGCCAA AACATTTGCG TCCAAGTTTT GTTTCTAAAG ACTTGCGAGG ATTAAACGAA
TCTCATAACG CTCCTGAACG CGTTAACGGT AGTACTTTTA CTTGTGAAGA TGCTATAGCA
AGAGTTGTGA ATAATAATAT TATTGAAGTT AATAACACTA ACGATTGCAA TGCTTTAAGA
GCTGCTTGCG CGCTGGCATG GAGCTTGCAA GATTGCGGTG AAAATATGGA AAATTATACT
CTTCCGGAGT TTTCCTTATG A
 
Protein sequence
MQVETKKNFS SCNRPLSDAF RLALLDLDGV VYRGGNAVEY ASDSILFAQK NGMAIEYTTN 
NSSRFQSVVA KQLESFGLKV EPWQIITSSV VAARMVARNV EKGSKVLVLG AEHLRQEVQR
VGLQLVDSCE DNPKAVIQGW YPQMTWQEMA EVSFAVEHGA KYFVTNRDLT IPREHGIAPG
CGSMIQAVIN ATGVEPISSA GKPESAMYDE ARFLVAANAK HDDSECEEYT EKDEYGNPVI
SIEHSLAVGD RLDTDIEAGT RGGYASLLVL TGVTDPRMLM LAPKHLRPSF VSKDLRGLNE
SHNAPERVNG STFTCEDAIA RVVNNNIIEV NNTNDCNALR AACALAWSLQ DCGENMENYT
LPEFSL