Gene HMPREF0424_1332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1332 
Symbol 
ID8709333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1592662 
End bp1594581 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content50% 
IMG OID646483417 
Productalpha amylase, catalytic domain protein 
Protein accessionYP_003374515 
Protein GI283783761 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.86601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACTT TTGACAGAGC AAACTTGCCA GAATCCGTGC GCACAAACGG CGCTACTCCA 
AACCCTTGGT GGGCGAATGC AGTTGTGTAT CAGATTTATC CACGAAGCTT CCAGGATACG
AACGGTGACG GCATTGGTGA TTTGAAGGGA ATTACTTCGC GCCTTGATTA TTTGGCTGAT
TTAGGCGTTG ACGTGCTCTG GCTCAGCCCT GTTTACAAGT CCCCACAAGA CGATAACGGC
TACGATATTT CCGATTATCA AGATATTGAT CCGCTTTTCG GAACTTTGGA AGATATGGAC
GAGCTGCTGG CTGGAGCTCA CGAGCGCGGT CTTAAAGTCG TGATGGATTT AGTAGTAAAT
CACACTTCTG ACGAGCACGC TTGGTTCCAG GCTTCAAGAG ATGCTTTTAG TGATTACGCT
GATTGGTATT GGTGGAGGCC TGCGCGTGAA GGTTGTGTGC CTGGTGAGCC GGGTGCGGAG
CCGAATAAGT GGGGTTCGTA TTTCGGCGGT TCCGCGTGGA CTTACGATCC GAAGCGCGGA
GAATACTTCC TTCACCAGTA TTCGCCAAAG CAGCCAGATT TGAATTGGGA AAATCCCGCG
GTTCGCGCTG CCGTGTACAA GATGATGAAT TGGTGGATGG ATCGCGGAAT TGATGGCTTC
CGCATGGATG TGATTACGCA AATTTCCAAG CATGTTGATT CTGAAGGTCG TTTGCCGGGT
GAAGATGGCT GCCAAATTGA AGATTTGCCA GCTGGTGCTG ACGGCTATTC TTCGCCATTC
CCGTTCTGCT CGGACGGTCC TCGTTTAGAC GAATTCTTGC GTGAGATGCG TGCAGAAGTT
TTCGACGGTC GCGAAGGTTA TCTTACTGTC GGCGAGGCTC CGGGAATTTC TCCGCACCGT
AATACTTATA TTACTGACCC TGCCCACAGC GAACTTGACA TGTTATTCCT GTTCAATCAT
GTGGATATCG ATTGCGAAAA CGGTACTAAG TGGAATCCTG TGCCGCTAAA GCTTACGGCA
TTAAAGAGCG TGATGGCAGA GCAGCAACAA GCGGTAGCGG ATGCTGGTTG GGCGAGCCTG
TTCTTTAATA ATCACGATCA GCCGCGTGCG CTTTCTCGCT GGGGTAGTGA GGCTAGCGAG
GAGATGCGCG TGCGCTCTGC TAAGGCGATT GCCATGCTTC TTCACATGCA CCGCGGCACG
CCTTACGTGT ACCAAGGCGA GGAAATTGGT ATGACTAACG CGCACTTTAC GCGTCTCGAG
CAGTATCGCG ATTTGGAAGC GTTGAATATG TTTAAGCAGC GCGTAGAAGA AGCGCATATT
CAGTCTGCTG AGTCGATGAT GGATGCGCTC GCAAAGCGCG GCCGCGACAA TTCTCGAACT
CCAATGCAGT GGAATGCTTC TAAATACGCG GGATTCATGC CTTTTAATGT TCAGTCGGTA
AATAACGCAG AGCCGTGGAT TAGTGTGAAT CCTAATTATG TGGATATTAA CGCAGCGGAG
CAGATGGAAG ATTCAGATTC CGTTTACGCT TTCTATAAGT ATTTGATTGC GCTTCGTCAT
TCGGAACCGA TTGTTTCTGC TGGTTCTTGG GATTTGGTTG ATGCAGATGA CGAGTGCGTG
TATGCTTTCG TGCGCGAGCT TAAATCTGGT TGCGAAAAGC AGGAAGATTC TGCAGAGACT
GCTGCGGAAT CTGCTGCAAA TTCCGCGGAA TCTACAGAAC GCATGCTCGT AATGGTTAAT
ATGACTGACT CAACTGTGCC TATCCCTACG CAGAGCGCGC AATTATTGCA AAATTGCGCC
TCTAACGCTT TTGTAGGGCG TGACGTAATG GTTACGACCT ACGACGTGAA TCACGCGCTT
ACATCGCTTA AAAACGGTAC GCTTGCGCCT TGGGAAGGTA TAGCTGTTAC TTTGCAATAG
 
Protein sequence
MTTFDRANLP ESVRTNGATP NPWWANAVVY QIYPRSFQDT NGDGIGDLKG ITSRLDYLAD 
LGVDVLWLSP VYKSPQDDNG YDISDYQDID PLFGTLEDMD ELLAGAHERG LKVVMDLVVN
HTSDEHAWFQ ASRDAFSDYA DWYWWRPARE GCVPGEPGAE PNKWGSYFGG SAWTYDPKRG
EYFLHQYSPK QPDLNWENPA VRAAVYKMMN WWMDRGIDGF RMDVITQISK HVDSEGRLPG
EDGCQIEDLP AGADGYSSPF PFCSDGPRLD EFLREMRAEV FDGREGYLTV GEAPGISPHR
NTYITDPAHS ELDMLFLFNH VDIDCENGTK WNPVPLKLTA LKSVMAEQQQ AVADAGWASL
FFNNHDQPRA LSRWGSEASE EMRVRSAKAI AMLLHMHRGT PYVYQGEEIG MTNAHFTRLE
QYRDLEALNM FKQRVEEAHI QSAESMMDAL AKRGRDNSRT PMQWNASKYA GFMPFNVQSV
NNAEPWISVN PNYVDINAAE QMEDSDSVYA FYKYLIALRH SEPIVSAGSW DLVDADDECV
YAFVRELKSG CEKQEDSAET AAESAANSAE STERMLVMVN MTDSTVPIPT QSAQLLQNCA
SNAFVGRDVM VTTYDVNHAL TSLKNGTLAP WEGIAVTLQ