Gene HMPREF0424_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1119 
SymbolgalE 
ID8709176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1288465 
End bp1289475 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content46% 
IMG OID646483210 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_003374320 
Protein GI283783566 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.368347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTC TTGTTACAGG TGGATGCGGA TACATTGGAG CACATGTAGT TCACGCTTTA 
CATGAAACAA AGCAAAATGT CGTCGTCGTT GACGATTTAA GCTACGGAAA GCCTACAAGA
ATTGAAGGCG CACGTCTTTA TGGTATGGAT ATCTCTTCTC CTGATGCAGG AAAGCGACTT
GCGCAAATTA TGAAAGATGA GCACGTCGAT GCTGTAATTC ACTTTGCTGC TAGAAAGCAA
GTTGGAGAGT CCGTAGAAAA GCCACTTTGG TATTATCAAC AGAATATTAA CGGCATGCTT
AATGTGCTTG AAGGCATGAA AGATAGCGGA GTTAAGAAGC TCGTATTTTC TTCTTCTGCA
GCAACTTATG GAGTACCTCC TGTTGAAGTT GTACCAGAAG ATGTTGTGCC AATGTTGCCA
ATTAACCCTT ATGGTCAAAC CAAGCTTTTT GGCGAGTGGA TGGCACGTGC TTGCGAGCAC
ACATACGGAA TTCGCTTCTG CGCACTGCGT TACTTCAACG TAGCTGGATG CGGTCCGGTG
GAATTGGAAG ATCCTGCGAT TTTGAACTTA ATTCCTATGC TTCTTGACAG ACTACAGCGA
GGAAAAGCAC CTGCTATTTT TGGCGACGAC TATCCTACTG CTGATGGCAC TTGCATTCGC
GACTATATTC ACGTTTCTGA TTTAGCAGAC GCGCATATTG CGGCACTCAC GTATTTGGAT
CGCGACGAGC GCAAGTACGA TGCTTTTAAC GTAGGAACTG GTAAGGGCAC GTCTGTGCGC
GAAATTGTTG ACGAAGTGCG CAGAGTTACC GGTTTACCAT TTAAAGAAAC TGTTCTTGAT
CGCAGAGCTG GAGATCCTCC ACAGCTTATT GGCAGCACGA AGCGAATTAA CGAAGAAATG
GGATGGCATG CTCGCTACGA CGTTAAGGAT ATCGTAGAAT CTGCTTGGGC AGCTTGGCAG
GCAAATCCTG AGCATCATAT TGATGTTGAT TCGTGGAAGC AGCTTGACTA G
 
Protein sequence
MTVLVTGGCG YIGAHVVHAL HETKQNVVVV DDLSYGKPTR IEGARLYGMD ISSPDAGKRL 
AQIMKDEHVD AVIHFAARKQ VGESVEKPLW YYQQNINGML NVLEGMKDSG VKKLVFSSSA
ATYGVPPVEV VPEDVVPMLP INPYGQTKLF GEWMARACEH TYGIRFCALR YFNVAGCGPV
ELEDPAILNL IPMLLDRLQR GKAPAIFGDD YPTADGTCIR DYIHVSDLAD AHIAALTYLD
RDERKYDAFN VGTGKGTSVR EIVDEVRRVT GLPFKETVLD RRAGDPPQLI GSTKRINEEM
GWHARYDVKD IVESAWAAWQ ANPEHHIDVD SWKQLD