Gene HMPREF0424_1316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1316 
Symbol 
ID8709118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1572231 
End bp1573502 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content45% 
IMG OID646483403 
Productextracellular solute-binding protein 
Protein accessionYP_003374504 
Protein GI283783750 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TTACAAAAAT ATGTGCACTG ATTGGTGCTG CTGCAATGAT TATTAGCGTA 
AGTGCGTGCG GTAGTACAAA ATCGTCCGAC GCTAATGGTG CAACATCGCT TACCATTTGG
CATTATTGGG ATGGTGCTAA TGCTGATACC TTTGATGCAA TGGTGAAGGA TTTTAATGCT
TCGCATAAAA ATATTAAGAT TAAGACTGCG AGCGTCCCGA ATTCTGATTT TATGACGAAG
CTTCGTGCAT CGGCTTCGTC GAAGAGTTTG CCAGATATTT CTATAAGTGA TTTGGTATGG
GTGCCACAGA TTGCAAAAAT GGGCAATTTG ACTGATCTTT CTAAGGTTAT CAGCTCGAAA
ACGCTCGATG ATGTTACTCC TGCATTGATT GACTATGGTC ATATTGACGG CAAACAAGTT
TCTGTGCCAG TGACTGCCAA TAATCTTGCG TACATGTACA ACAAGGATGT TTATAAAGAA
GCTGGGTTAG ATCCTAACAA GCCACCGCAA ACATGGGATG AGTTGAAGAA AGTTGCCAAG
ACAATCAAGG AAAAGACGGG CAAGCCAGGG TATGATTTGC TTACTCAAGC AGGAGATAAC
GGCGAAGGTT TAACTTGGAA CTTCCAGGTC AACTTGTGGC AAGCTGGTGG CGAATTCTTG
ACGAAGGATA ATTCTAAGGC TGCATTCAAT ACGCCAGAAG GCAAGAAGGC TATGAACTTC
TGGATGGATC TTATCAAGAG CGGCGTGAGC CCATATGCTA AGTGGGGCGA ATTTGAAAAG
GGCAAGGGTG GTTCTGCTCA GGAAGGTAGC TGGATGGTTG GCATCTGGGC GCCAGATCCA
CCATTTGATT TCGGTGTAGC AAAAGCCCCT CATCCAAAGG ATGGTAAGGA GGCAACCAAT
CTCGGTGGTG AACAAGCAAT CGTCTTCCAC AATTCTGACG CTCGTGCCAA GGCTGCTGGC
GAGTTCTTGA ATTGGTTCTT GCAGCCAGAA CAGGTGATCA AGTGGTCGCA AAAGACTGGC
ATGCTTCCTG TAACGAAGAG TGTTGCAAAG TCTGATAAGT ATTTGGATTG GGTTAAGAAG
GAACAGCCTC GTTTAATTCC GTTTGTGGAA CAAATGGAGA TTGCTCATAC ACGTCCAAAT
ACGCCATTGT ATCCAAAGAT TTCCTTCGAA TTTGCAAAGG CTGTGGAGAA GGCTTTCGCT
GGAGAGCAGA GTGTTGACGA AGCGCTTGCG AATGCTGAAA AGGCAGTAAA CGACGTGATT
GCCAAAGGCT GA
 
Protein sequence
MKKLTKICAL IGAAAMIISV SACGSTKSSD ANGATSLTIW HYWDGANADT FDAMVKDFNA 
SHKNIKIKTA SVPNSDFMTK LRASASSKSL PDISISDLVW VPQIAKMGNL TDLSKVISSK
TLDDVTPALI DYGHIDGKQV SVPVTANNLA YMYNKDVYKE AGLDPNKPPQ TWDELKKVAK
TIKEKTGKPG YDLLTQAGDN GEGLTWNFQV NLWQAGGEFL TKDNSKAAFN TPEGKKAMNF
WMDLIKSGVS PYAKWGEFEK GKGGSAQEGS WMVGIWAPDP PFDFGVAKAP HPKDGKEATN
LGGEQAIVFH NSDARAKAAG EFLNWFLQPE QVIKWSQKTG MLPVTKSVAK SDKYLDWVKK
EQPRLIPFVE QMEIAHTRPN TPLYPKISFE FAKAVEKAFA GEQSVDEALA NAEKAVNDVI
AKG