Gene HMPREF0424_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1064 
Symbol 
ID8709615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1214195 
End bp1215637 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content41% 
IMG OID646483156 
Productextracellular solute-binding protein 
Protein accessionYP_003374267 
Protein GI283783513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.075314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATC TTATGTCTGC ATTTGCAAGA ATTTTGCAAC AACGCAATCA TTTTTCGCGC 
TTGCTTCTGT CGATGCTTTG TGTTTTTTCT TTGCTATTTT CTTGCGGATG CGGAACTAAA
GATTCTAGAA CTCAAATTAG TGTTTGGTCT TGGGAACCAA GCATGAAGCG ACTTGCTGAA
GAATTTGAAC TACATAATCC GGATGTTCAT GTAACTGTAA AGGATACTAG CGGATATAGC
AATCTTAATA GCGCTATTCA AGATGGCTAT GGTATGCCAG ATGTTGTTCA ATTAGAATAT
TTTGCTCTTC CGCAATATGC GGTAAGTGGA CAGCTTTTAG ATATTACTGA TCGTGTGAAA
AATACTCGCA CTTTCTATAC TCCTGGAACA TGGTCTTCTG TGCAACTTGG TGGACGTGTT
TACGGTTTAC CAATGGATTC TGGCCCAATG GCTTGGTTTT ATAATGATGA CGTGTTTAAG
CAGGCTGGTG TAGATGCCAC AAAAATACAC ACTTGGGAAG ATTACCGTCA TGCTGCTCGT
AAGCTTAAAG ATATCGGTGT GTATATTGCA GCAGACTCAG GTGACGCAAG TTTTTATAAC
GCTATGATTT GGCTTGCAGG TGGACACCCG TTTATGACTT CTCATGATGG GAAAACAGTT
ACTGTTAGAT TGAGTAAAGA TAAGGGTACT GAAGAGTTCA CAAAGTTCTG GCAATCAATG
ATTGATGAAG GATTGATTGA CATTAGAACT AGAACTTGGA GTCAACGTTG GAAGAACGGT
GTTGGAGCTG GCAAGATTGC TTCTGTTTTC TCAGGAGCCT GGATGCCGTC TTTGCTGCTG
GAAAATGTGC CTGGAACTGC AGGATTATGG AAGGTAACGA ATGTGCCAAC TATGCACGGA
GAAAAGCGTA ATGCAGAAAT GGGCGGTTCT TCGTTATCTG TGCTTAAATC GAGTCGTAAG
CCTGAAGCAG CAATGCGATT TGTGAATTTT GTATGCCATG ATATGCATGG AATTCGTACT
CGTGTCAATG GTGGAGCGTT TCCTGCAGAT GTTGTTACGC TTAGAGATAA GTCATTTTTA
GATAGAGCTA CTATTCGTGA TTCACGAGGT ATTGATATAC CCTATTTTGG CGGCCAGAAG
TTTAATCGTG TATTTGCAGA TGCCGCTAAT CGCGTGGATA CGGGATACAG GTATTTGCCT
TTTGAAGTGT ATTCGCGAAG TGATTTTAGA GCAACTATGG GTCAAGCATA TGACTGGAGT
GTTAAATCTT TAGCTAGATT AAACGTGCAA GCCATGATTG ATGCTGGTGT TACTCAAGAT
GATGGAAGTA AATTGTGGCT TCCAGATGAT CCTGGTAAGC GAATTTCTTT AAAAGATGGT
CTTTTATTGT GGGAAAAAGA TCTTCAAGAA TATGGTTACA ATCAAGGATT TGTAGTTAGA
TAA
 
Protein sequence
MANLMSAFAR ILQQRNHFSR LLLSMLCVFS LLFSCGCGTK DSRTQISVWS WEPSMKRLAE 
EFELHNPDVH VTVKDTSGYS NLNSAIQDGY GMPDVVQLEY FALPQYAVSG QLLDITDRVK
NTRTFYTPGT WSSVQLGGRV YGLPMDSGPM AWFYNDDVFK QAGVDATKIH TWEDYRHAAR
KLKDIGVYIA ADSGDASFYN AMIWLAGGHP FMTSHDGKTV TVRLSKDKGT EEFTKFWQSM
IDEGLIDIRT RTWSQRWKNG VGAGKIASVF SGAWMPSLLL ENVPGTAGLW KVTNVPTMHG
EKRNAEMGGS SLSVLKSSRK PEAAMRFVNF VCHDMHGIRT RVNGGAFPAD VVTLRDKSFL
DRATIRDSRG IDIPYFGGQK FNRVFADAAN RVDTGYRYLP FEVYSRSDFR ATMGQAYDWS
VKSLARLNVQ AMIDAGVTQD DGSKLWLPDD PGKRISLKDG LLLWEKDLQE YGYNQGFVVR