Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0678 |
Symbol | |
ID | 8709152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 765614 |
End bp | 766714 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 646482783 |
Product | HAD hydrolase, family IIA |
Protein accession | YP_003373906 |
Protein GI | 283783152 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGTTG AAACGAAAAA GAATTTTTCT TCATGTAATC GTCCGTTAAG CGATGCATTT CGTTTAGCAC TGCTTGATTT AGATGGTGTT GTATATCGTG GTGGAAATGC TGTTGAATAT GCGTCTGATT CCATTTTGTT TGCGCAAAAA AATGGAATGG CAATTGAATA CACAACTAAT AATTCTTCGC GTTTCCAATC TGTTGTTGCA AAACAACTGG AAAGTTTTGG CTTAAAAGTA GAACCATGGC AGATTATAAC GTCTTCTGTT GTTGCAGCAA GGATGGTAGC TCGCAATGTT GAGAAAGGAT CCAAAGTTCT TGTTCTCGGC GCTGAACATT TGCGTCAAGA AGTGCAACGC GTAGGATTAC AACTCGTAGA TTCGTGCGAA GATAATCCTA AAGCTGTTAT TCAAGGCTGG TATCCTCAAA TGACTTGGCA AGAAATGGCG GAAGTTTCTT TCGCTGTTGA GCATGGTGCA AAATATTTCG TTACTAACCG CGACTTAACT ATTCCAAGAG AGCATGGTAT TGCTCCTGGA TGCGGTTCCA TGATTCAAGC TGTTATTAAT GCTACGGGTG TAGAGCCTAT TTCGTCTGCT GGAAAACCAG AATCTGCAAT GTATGATGAA GCAAGGTTTT TGGTTGCTGC TAATGCTAAG CATGATGATT CTGAATGTGA AGAATACACT GAAAAAGACG AGTATGGGAA TCCTGTAATT AGTATTGAAC ATTCATTAGC AGTTGGAGAT CGTTTAGATA CTGATATTGA GGCTGGAACA AGAGGAGGCT ATGCTTCATT ACTTGTTCTT ACTGGAGTTA CAGATCCTCG CATGCTTATG CTTGCGCCAA AACATTTGCG TCCAAGTTTT GTTTCTAAAG ACTTGCGAGG ATTAAACGAA TCTCATAACG CTCCTGAACG CGTTAACGGT AGTACTTTTA CTTGTGAAGA TGCTATAGCA AGAGTTGTGA ATAATAATAT TATTGAAGTT AATAACACTA ACGATTGCAA TGCTTTAAGA GCTGCTTGCG CGCTGGCATG GAGCTTGCAA GATTGCGGTG AAAATATGGA AAATTATACT CTTCCGGAGT TTTCCTTATG A
|
Protein sequence | MQVETKKNFS SCNRPLSDAF RLALLDLDGV VYRGGNAVEY ASDSILFAQK NGMAIEYTTN NSSRFQSVVA KQLESFGLKV EPWQIITSSV VAARMVARNV EKGSKVLVLG AEHLRQEVQR VGLQLVDSCE DNPKAVIQGW YPQMTWQEMA EVSFAVEHGA KYFVTNRDLT IPREHGIAPG CGSMIQAVIN ATGVEPISSA GKPESAMYDE ARFLVAANAK HDDSECEEYT EKDEYGNPVI SIEHSLAVGD RLDTDIEAGT RGGYASLLVL TGVTDPRMLM LAPKHLRPSF VSKDLRGLNE SHNAPERVNG STFTCEDAIA RVVNNNIIEV NNTNDCNALR AACALAWSLQ DCGENMENYT LPEFSL
|
| |