Gene HMPREF0424_0911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0911 
Symbol 
ID8708841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1040112 
End bp1041305 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content40% 
IMG OID646483009 
Productshikimate dehydrogenase substrate binding domain protein 
Protein accessionYP_003374125 
Protein GI283783371 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAA CATATCATTG CGCAGTTTTG GGAGATCCTA TTTCGCATTC GCTGTCTCCT 
GTGCTTCACA ACACGGCTTA TAAGGTTTTA GGTCTTACAA ATTGGCAATA TAGTAAGGAA
AAAGTAAGCG AAACAGAGCT TCGCGATTTT ATTGCGCATC TTGACGATTC TTGGAAAGGC
TTAAGCCTTA CTATGCCGCT TAAAAAAACC GTAATGAAAC TCGGTACACC TTGCGACTAC
TGGTCGCGCG CGCTTAACGT GGCAAATACT GCAGTTTTTA CTAGCAACCC AGCAACCGCT
TCGCAATCGC AAATAAAGCT ATACAACACT GACGTAAGTG GCATATTAAT AACATTTATG
CAAGCATTAC ATGAGCGCAT ATGCGAAGTA AAAAAGGCTG TTATTATTGG CAGTGGCAAC
ACTGCGTCCT CTGCATTGGC AGCTTTAATT GAAATTGCGC AAGTAGCAAA GCTTGAGCAA
GTGCAAGTTG TTGCTAGAAC TGAAGATGAC GGTAGTGTAA AAGGCGTTGA ACGTTTTAAT
AATCTTATAC AAAAGTATTG CGCTGATTTT CCAAATAAAA TAAGCATAGG TCAAAATAAT
AAAAGCGAAG AAAATGACGC TATGTATGAG CCTGTGCAAG GTATTGAGTT CCCAACAATA
AGCGAAGAAG TATTTTCAAA ATTTGCAGAC TATAGCGCAG AAAAATTGAG TGCAGAAAAA
TTGTCAGAAT GTGATAAATC TCTCATTTTG AAATCTTTAA ATTCACTCGA AACTGTTCGT
GCAATCGCAG AAGCAGACAT AGTCATAAGC ACAGTTCCAG CGCATGTAGC AGATCCAATT
GCGCTTGCGT TGAAAGCGTA CTGTCAAGAT AGTGCGAATA ATCAAACGAA ATTAGGAACA
TTGCTTGACG TAGTTTACGA TCCTCGCCCA AGCATGCTAC TTAGTGTATG GCAACAATAC
GGTTGCGCTA TTGGCGGAGA AGAAATGCTT TTGCAACAAG CTTTAGCACA AGTAAGTCTT
ATGACGATTG AGTACAGGCA ATCTAAAGAA AATTTTGGGA ATTGTGCTAA GACTGGCGCT
GAGGATTGTG CTGAGACTGG TTCTGAGCTT GGTTATAAAC CTAGTGCTGA CAAGGCTAGA
TATTACGATG ACTTAAGTAG TCTCATGCGA AAAGCCTTAC AGGAGGCATT ATGA
 
Protein sequence
MLKTYHCAVL GDPISHSLSP VLHNTAYKVL GLTNWQYSKE KVSETELRDF IAHLDDSWKG 
LSLTMPLKKT VMKLGTPCDY WSRALNVANT AVFTSNPATA SQSQIKLYNT DVSGILITFM
QALHERICEV KKAVIIGSGN TASSALAALI EIAQVAKLEQ VQVVARTEDD GSVKGVERFN
NLIQKYCADF PNKISIGQNN KSEENDAMYE PVQGIEFPTI SEEVFSKFAD YSAEKLSAEK
LSECDKSLIL KSLNSLETVR AIAEADIVIS TVPAHVADPI ALALKAYCQD SANNQTKLGT
LLDVVYDPRP SMLLSVWQQY GCAIGGEEML LQQALAQVSL MTIEYRQSKE NFGNCAKTGA
EDCAETGSEL GYKPSADKAR YYDDLSSLMR KALQEAL