Gene HMPREF0424_1181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1181 
Symbol 
ID8709023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1384114 
End bp1385853 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content37% 
IMG OID646483271 
Productglycosyltransferase, group 1 family protein 
Protein accessionYP_003374378 
Protein GI283783624 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCT TTAATCTAGA TGTTAATGCA AAAGCTTATA TAGACTCTTT AAGCGAGCGT 
CAACGCAGGC TTATTGTGGG TGCTATAAAT TATCATTCCC TAACAGCTCC ATCTAGAACT
TTGCAATATC AAGTTTACAT GGGTCAAAAA TGGCAGTATC CGCGATTGGC TGATTTTTCA
GTTTTTGACG GTGCTTGTAC AAAAAACGAA GGATTAAAAC CAATCGCAAC TTTTAGTGCA
CGATCAACTG TTAAAACTAA TAGCGAATAC ACAAAAGAAT TTTGGAATGA AGGCAAACTG
TTTAAAGATA ATGGCGTTAG GTATTTGTTG CTGACACCAG ATGATGCAAC CGACGATGAT
GTGATGTCTC ACTGGGCTTC TGATACGTTA AAGCAGCACA TTTACAAAGA GTGTGAATTG
CACGCCCCTT CTATGTTTGA TGTTGTTAGG GAAGTAATGC GCAAAAAACG CGAGTGTGAT
TCGAAGAAAC CTGTTGAAGA GTATTGTCCT TGGAATAAAA AGAAGTTTTA TTATGTTGCA
AATGCTCACC AAAGAGAAAT TGCAAAACAA GAAAATTGGG ATTTATCAAA TATAAAAGAT
GCTAGATTGG ACTGTGTTTC TCCTGCTAAA AGAGTTGAAG GAGCTAAAAA AGCTATTATT
ATAGGTTTGC ATTGGCTGCA AGCTGGTGGT GCAGAAAGAT GGGCTGTAGA AACAATAAAA
CTTGCCAAAA AAGCTGGTTT TGTGCCAATT GTTATCACAG ATAGAGACGG TCACCAGCCG
TGGATTACGA AAGATTTCTG TGATGATGCT CTTGTGTTAC CACTTACTCC ACCAATTAGA
GAATATTGGG CGGATGTTCC ACTTTTAAGA TCTTTGTTTG AGCAATTTGA GATTAAAGCA
ATAGCAATAC ACCATTGCCA ATGGCTTTAC GACCATATTT GGTGGGTTAA ACAATTCTTC
CCAGAAACAC ATATTATTGA TTCGCTGCAT ATTGTTGAAT ACATAAATTG TGGAGGATAT
CCGCATGAAG CTGTAACACA CGATAAGTGG ATTGATTTGC ACCATGTTAT TTCTCCACAA
CTTGAAAATT GGTTGCATGA TGTGCATGGT ATTGAATCTT CCAAGATTGT TGACGCTCCA
CTTATTGGTC TTACTGCTGG AATTGGAAAC GATTGCGTAT CGCCGAGAAA AAATAAAAAT
GTTCTGAATG TTGCTTTTGT TGGAAGGATT GCAAGACAAA AACGTCCAGA AGCGTTTGTT
CTTGTTGCAA AAGCTTTAAA CAAAAAATAT CCAGGGAAAT TCCACTTTAT TTTGCATGGA
GATGGCGAAT TAGACACATT TGTTTCAGAA CTTATATCTA GGTATGGTCT TGAAAACGTT
ATTGAGCGCA GATCTATGAA AATTCCAGCT CAAAAAACGT ATGATGATGC GGACGTGCTA
CTTATTAGCT CTGTAAACGA GGGTATTACG CTTACAACTA TTGAAGCACT TTCTAATGGA
GTTCCAGTAA TTTCATCGGA TGTTGGATCG CAAAAAACTG TTATAGCAAA AAGTGCGTTA
TTGCCAAGAA TGACTTCTAG TTTTGTTAAA GCGTCTGTAA AGGAATTGTT TAATATTATG
GAGAATGATT CATATCGTTC ATTATTGTTA AAAGAAGAAG TTGAAAAATT GCGCAATTTC
GCAAAACTCG AATCTGCAGA AACGTTATTT ACTAAGTTAT TTGAAGAATG GAGCAAGTAA
 
Protein sequence
MNFFNLDVNA KAYIDSLSER QRRLIVGAIN YHSLTAPSRT LQYQVYMGQK WQYPRLADFS 
VFDGACTKNE GLKPIATFSA RSTVKTNSEY TKEFWNEGKL FKDNGVRYLL LTPDDATDDD
VMSHWASDTL KQHIYKECEL HAPSMFDVVR EVMRKKRECD SKKPVEEYCP WNKKKFYYVA
NAHQREIAKQ ENWDLSNIKD ARLDCVSPAK RVEGAKKAII IGLHWLQAGG AERWAVETIK
LAKKAGFVPI VITDRDGHQP WITKDFCDDA LVLPLTPPIR EYWADVPLLR SLFEQFEIKA
IAIHHCQWLY DHIWWVKQFF PETHIIDSLH IVEYINCGGY PHEAVTHDKW IDLHHVISPQ
LENWLHDVHG IESSKIVDAP LIGLTAGIGN DCVSPRKNKN VLNVAFVGRI ARQKRPEAFV
LVAKALNKKY PGKFHFILHG DGELDTFVSE LISRYGLENV IERRSMKIPA QKTYDDADVL
LISSVNEGIT LTTIEALSNG VPVISSDVGS QKTVIAKSAL LPRMTSSFVK ASVKELFNIM
ENDSYRSLLL KEEVEKLRNF AKLESAETLF TKLFEEWSK