Gene HMPREF0424_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0218 
Symbol 
ID8709802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp244375 
End bp245556 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content50% 
IMG OID646482337 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_003373479 
Protein GI283782725 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTA CAGCTGAGCA GCGTACTTCT CGCGCGGTAA AGAAGGAAGT TGAGGAGGGG 
CGTAACCCTC TTAATTCTGC GAAGTTTCAG CGCTGGGAAG ACCAGGTTGG TGTGGATCGC
ATAATAAACC GCCGCGTGCT TGAGCTGGAG CCTTTGCCAA CTCCTGCTCA GGTGCTTGGT
GAGTTGCCAT TGTCGCCGTA TGCGCAGGAT ATTGTTGCGC AGTCGCGCGA TGAGATTCGT
AGCTGTTTGA ATGGCGAAGA TGATCGCTTG TTGGTTATTG TTGGCCCGTG TTCTGTGCAT
GATCCTAAGG CTGCTCTTGA TTATGCGAGT CGTTTAGCTA AGTTGCGTTC GGAGCTTGAA
GATGAGCTTC TTATTGTTAT GCGCGTGTAT TTTGAAAAGC CGCGCACAGC TCTTGGCTGG
AAGGGTTTGA TTAACGACCC GGATATTGAC GGAACGCATG ATATTCATAA GGGTTTGTTG
CTTGCGCGTA AGACTTTGCT CGGCGTGTTG GATGCTGGTT TGGCTGCTGC TACTGAATTT
TTGGAGCCGA CCAGCCCTCA GTTTATTGCG GATGCTGTGA GCTGGGGTGC GATTGGTGCG
CGTAATACTG AGTCTCAGAT TCATCGTCAG CTTGCTAGTG GTCTTTCTAT GCCGGTTGGT
TTTAAGAATG CTACGGATGG TTCCGTTTCT GCGGCTGTAA ATGGCTGCTT AGCTGCGAGC
CAGCATCATA CGTTCTTTGG TATTGATCAT CTTGGTCGCG CTTGTGCTGT GCAGACTCTT
GGCAACCCTG ATTGCCATGT GGTTTTGCGC GGTTCTAGTA AGGGTCCTAA TTATGATGCT
GCGTCGGTTG CTTCTGCTGT TGCTTCGATT CGTTTGCGCT TGGGCACTGG TTGTATGGCT
TCTAACGGCG TGGTTGTGGA TTGTTCGCAT GGTAATTCTG GCAAGGATGA GCGTCGCCAG
ATTGAGGTTG TGCGTGAGAT TGCAGATCGT TTGGCTGATG GCGAGCAGGG TATTAAGGGC
GTTATGATGG AGAGCTTCTT GCAGGGCGGC AATCAAGATC CTGCTCCTCT TAGCGAGCTT
GAATACGGCA AGTCTATTAC GGATCGTTGC ATTTCTTGGG AAGATACTGC CAATTTGCTA
AGAATTTTAG CTAAATCTGT AGCAACTCGC CGTAGAGCTT AG
 
Protein sequence
MASTAEQRTS RAVKKEVEEG RNPLNSAKFQ RWEDQVGVDR IINRRVLELE PLPTPAQVLG 
ELPLSPYAQD IVAQSRDEIR SCLNGEDDRL LVIVGPCSVH DPKAALDYAS RLAKLRSELE
DELLIVMRVY FEKPRTALGW KGLINDPDID GTHDIHKGLL LARKTLLGVL DAGLAAATEF
LEPTSPQFIA DAVSWGAIGA RNTESQIHRQ LASGLSMPVG FKNATDGSVS AAVNGCLAAS
QHHTFFGIDH LGRACAVQTL GNPDCHVVLR GSSKGPNYDA ASVASAVASI RLRLGTGCMA
SNGVVVDCSH GNSGKDERRQ IEVVREIADR LADGEQGIKG VMMESFLQGG NQDPAPLSEL
EYGKSITDRC ISWEDTANLL RILAKSVATR RRA