Gene HMPREF0424_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1233 
Symbol 
ID8709920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1466029 
End bp1467171 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content48% 
IMG OID646483321 
ProductD-ala D-ala ligase N-terminal domain protein 
Protein accessionYP_003374426 
Protein GI283783672 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000148623 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAAAA AGCGTGTAGT GTTCTTGTAT GGCGGTAAGG CAGACGAACA TTCGATTTCT 
TGCATATCCG CTGCTGGAGT GCTCAGTGCT GTAGACGAAA ATCGTTTTGA CATTGTGCGG
ATCGCAATCA CAAAACAAGG CGAATGGATT GTTGGTGGTG AAGATCCTCG TAACTTCCAC
ATGGAAGAGG GTGAGTTGCC AGTAATCACA AAAACCGCTG GAACGCGCGA TGTTGTGCTT
GACCCTGCCA AAGGCGCAGA CGGATTTTTA GCCCGCGAAA ATGACGGTAC ACTTACAAGT
TTGGGTCATA TTGACGCAGT TTTCCCAGTT CTTCACGGTC CAAACGGTGA AGATGGCACT
TTGCAAGGCT TGCTAGAAAT GATGCAAGTT CCATACGTTG GTTGTGGAGT GTTGGCTTCG
GCTGCGTGCA TGGATAAATA CTATGCAAAG CAACTGTTTA AAGCTGCCGG AATTGATGTT
GCTCCTGGAA TTGCACTTGA TGCGCGCAAG TTTGCAAGCG ACGCGGAAAA TCGCTTTGAT
GCATACGCAG AAGAAATTCT TGCTCAAGTT GAAGCAGCAA AACTCGAGTA TCCGCTTTTT
GTAAAGCCAA GCCGCGCAGG ATCTAGCTTT GGAGTTACAA AAGTTGAATC GCGCGATGCT
AAAGCCTTAG CAGAAGCTAT TTTTGAGGCT TCAGAGCACG ATTGGCGAGT GTTAGTAGAG
CAGGGAATTG ACGCTCGAGA AATTGAGTGC GCGGTTCTTG CAGCTCGCGA CGGCGAAGAA
CCGAAAGCAA GCTGGCCTGG AGAAGTTGTG CTAGACAAGC GCGAGGCTGG TGAAGACCAG
TTCTACGATT TTGACAGCAA ATACATGGAT TCCGCAGCTT CGCACGTTGA AGTTCCAGCA
AACTTGCCAG CGGAAACTTT AGAGCGCGTT CGCAAAACGG CGCTTGCAGC GTTCAAAGCG
GCAGATGGGC GGGGACTAAG TCGCGTAGAT TCTTTTGTAA CTAAAGATGG TCGCGTAATT
CTTAACGAAA TCAACACGAT GCCTGGATTT ACGCCAATCT CCATGTATCC AAAAGCTTGG
GAAGCAACTG GAATAAGCTA CAGCGATTTA ATCACAAAGC TTATTGAAGG CGTTTTGAAA
TAA
 
Protein sequence
MTKKRVVFLY GGKADEHSIS CISAAGVLSA VDENRFDIVR IAITKQGEWI VGGEDPRNFH 
MEEGELPVIT KTAGTRDVVL DPAKGADGFL ARENDGTLTS LGHIDAVFPV LHGPNGEDGT
LQGLLEMMQV PYVGCGVLAS AACMDKYYAK QLFKAAGIDV APGIALDARK FASDAENRFD
AYAEEILAQV EAAKLEYPLF VKPSRAGSSF GVTKVESRDA KALAEAIFEA SEHDWRVLVE
QGIDAREIEC AVLAARDGEE PKASWPGEVV LDKREAGEDQ FYDFDSKYMD SAASHVEVPA
NLPAETLERV RKTALAAFKA ADGRGLSRVD SFVTKDGRVI LNEINTMPGF TPISMYPKAW
EATGISYSDL ITKLIEGVLK