Gene HMPREF0424_0862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0862 
Symbol 
ID8710023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp975089 
End bp976588 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content42% 
IMG OID646482962 
ProductCBS domain protein 
Protein accessionYP_003374079 
Protein GI283783325 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATATA TTAAACGAAT ATTTTCTAGC ATATCAGTTC CGCAATTGCC GTTGCATGTC 
TGGATTGGTT TAGCCGCATT GCTTGTGCTG GCAATGCTGC TTGTATGGCT TTCGCTTGCG
ATGGCTGCTG CAGAAGGTGC AGTGCCTAGA GTTACTCGTG CAAGCTTAAA TAATCTTATG
ATTGAAGCGC AAACGCATGA TGATGAAACT AGTGCCATAG TGCGTATGCG TCGAATTGCT
AGAATCCATC GTGTAAAAAA ACTGATTGCT GACCGTTATA CAACTTCTGG AGCATGCGCT
TTTGTGCGTG TGCTATGTAA CGTTTTAGAT GGAGTTTTGT TTGCTTCTTT AGCTGCATTT
TTCGGCTCGC CTTTATGGGT AACTCTTATT ATTGGTGTTG TTGTTTCAGT AGTTGTAGCT
GTGATTTCTA TTCTTGTTCG CCCACGTTCG GTTGGTGCGG CTCAACCTGT AGCAAAGTTG
ATGCGGCTTT CTCGCATTGT GTCTATTGCT ATTGCGATTA ATCCTTTAGT GCATATTGCT
AGCGATGTTG ACGTCACTTC TAGGAAAAAG CATGTTGATC CGTCAGATGA TGAAGCTTTG
GAAGATATTC AGCTGGATCA AGCAAAGGCT AGTATTGACC GACTTGTTGA AGCGAATGAT
TTCGATCCGG AAGTTTCGGA AATGATGCGC AATGTGCTCA CGCTTTCAGA TACTTTGACT
CGCGAAATAA TGGTACCTAG AACCGATATG ATTTGCGTAA AAAGCGACGA AACGCTGGAA
AATTTCCTTA AATTATGCTC TCGCTCAGGA TTTTCTCGAG TTCCCGTCAT TGGAGATTCT
GTAGATGATT TAGTAGGTGT TGCTTATTTG AAGGATGCTG TACGCGCTAC TGTGTTTAAT
CCTGCAGCAT CTTCTAGAGC AGTTGAAACA ATTAGCCGAG ACCCTATGCT TGTTCCTGAG
TCGAAGCCGG TTGACGATTT ATTCCACGAA ATGCAGCGAA TTCGCCAGCA TGTGGCAGTA
GTGGTTGACG AATATGGTGG AATTGCTGGT TTGGTTACTA TTGAGGATGC TATTGAGCAA
ATTGTTGGCG AGTTGGAAGA TGAGCACGAT CGCACTCAGC ATGCAGATCC GGAAGAAATA
CGTGACGGCG TGTGGAAGAT GCCTGCTCGA ACGTCAATCG CAGATTTGGA AGATATTTTT
GAAGTTCATA TTGACGAAGA CGATGTGGAT ACTGTGTTTG GTTTGCTTAC GAAGTTGATT
GGTAATGTGC CGATTGTTGG TTCCAGTGCT ATTACAAGAG GACTAAAATT AACTGCTGTA
GATTCAGCTG GTAGGCGAAA GAAAGTATCT ACTATTTTAG TAGAACGAGA TGCGCAAGTT
TTGGAAGAGT CTAAGGATAA TCAAAATGAC ATTAGCTCTT CCGAAAAATC TGCTGAAAGC
AAACAAACCA ATGCTTTGAA GCAAGCTAGT ACTATGAAAC AAACGAATAT GCTGAAATAA
 
Protein sequence
MEYIKRIFSS ISVPQLPLHV WIGLAALLVL AMLLVWLSLA MAAAEGAVPR VTRASLNNLM 
IEAQTHDDET SAIVRMRRIA RIHRVKKLIA DRYTTSGACA FVRVLCNVLD GVLFASLAAF
FGSPLWVTLI IGVVVSVVVA VISILVRPRS VGAAQPVAKL MRLSRIVSIA IAINPLVHIA
SDVDVTSRKK HVDPSDDEAL EDIQLDQAKA SIDRLVEAND FDPEVSEMMR NVLTLSDTLT
REIMVPRTDM ICVKSDETLE NFLKLCSRSG FSRVPVIGDS VDDLVGVAYL KDAVRATVFN
PAASSRAVET ISRDPMLVPE SKPVDDLFHE MQRIRQHVAV VVDEYGGIAG LVTIEDAIEQ
IVGELEDEHD RTQHADPEEI RDGVWKMPAR TSIADLEDIF EVHIDEDDVD TVFGLLTKLI
GNVPIVGSSA ITRGLKLTAV DSAGRRKKVS TILVERDAQV LEESKDNQND ISSSEKSAES
KQTNALKQAS TMKQTNMLK