Gene HMPREF0424_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0438 
Symbol 
ID8709095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp479652 
End bp481361 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content41% 
IMG OID646482553 
Productextracellular solute-binding protein 
Protein accessionYP_003373683 
Protein GI283782929 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000302418 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCATTT TCAAAAATAA TACGGCTTCA AAGAAAATTC TTGCTTCAAC AGCAGCTTTT 
TCACTACTTA CAATTTCACT ACTTTCGCTT TCAGCTTGCA CAAATTACAA CTCTGGGAAA
ACCAGTATTA CGAACACTTC CGTGCTAAAT TCTAATCCTA AAAAAGGCGG AACTTTTACA
ATATTTACGT CAAATACGAA TATGAATTTC GATCCAGCGC GCAGTCAGGG TTTGCCAATT
ACGTCAAATA ATTTCATCTT CCGCGCGCTT ACTACTTGGA AAGTAAATCC GGATTTAAGC
AAGCAAACGC GTGTAGTTCC AGATTTGGCA ACGGATACTG GCACAACAAG TGACGGCGGA
AAAACTTGGA AATACACTCT TAAAAAAGGT GTTAAGTATG AAGATGGAAC CGAAATCACT
TCGCACGATA TTAAATTTGG TATTGAGCGC TCGTTTGCAG ATTCGTTAAG CGGTGGTTTT
GGATATCACA AAACTCTTCT TGTTGGAGCT GAAAACTATA GAGGACCATT CGACAGAAAG
TCGCTTGATT CGATTGAAAC TCCAGATAAT CAAACGATTA TTTTCCATTT GAAAGCACCT
TTTGCAGATT GGCCGTGGGT GACATCTTTG GCTGCATTTG TGCCAGTTCC AACTAGTTCA
GGCGATGCTC AGACATATAG TAAAAGGCCG AAATCTTCAG GACCATACAG GATTGTGCAG
AACGAAGTCG GAAAGCAAGT TGTAATGCAA CGCAATAAGT ATTGGGATTC CAAGCTCGAC
AGTACTCGCA CAGCTTCTGC AGATGAAATT GTGTGGAAAC TTGGTTCAGA TACTACTGTT
TCAGCTCAGT CTATGATTCA AGGAAATACG GATACAAAAA CCGCATTTTT GGCTGATTTT
GTGCCGCCTG CTCAGCTTGC ACAAGCTCAA GCGCATCCTC AATCTCGCAA GTTGCTAACT
ACTAGTAGTG ATGGTGCTCT TGAATATTTG GCTATTAATA CGCGCAGAAT AACGGATATC
AATGTTCGAA AAGCGATTCA GTATGCTGTT GATAAGCAGT CGTATCGAAC TGCGAAGGGT
GGCGAAATTG CTGGAGGTTT TGCTACTACG CTTATTACGC CTGGTATTTC TGGTCGAAAG
CAGTTTAATT TGTATTCTGA AGATCCTCGA GGGAACGTTG AGAAAGCTAA ACAACTGTTG
AAAAAATCAG GTAAAACCGA TATTAAGTTG ATTCTGATTG CTCGTCCGGA TCAAACGCAA
GTGGCATCTT CTGTGCAATC AAGTTTAAAG CGAGCTGGTA TTAAAGTGAC GATTAAAACT
GTTGACGCAG TAAACTTTAC GGATGCAATT ACGTCAAATT CCGGAGATTA CGATTTGGCT
TTGGCTTCGT GGCAGCCTGA TTTTCCTTCT GCTTTTGCGA ATTTGGGACC GCTTTTTGAT
TCTTCGCAAA TTGGCGGCGG TAACTGGAAT ATTTCTCGTT ATTCGAATCC TAAAGTAGAT
GCTTTAATTC GCGAAGCTGT GCAAACTGTT GACGAGAATT CGGCTCATAA ACTGTGGCAA
AAGGCGGATC ACGCTATTAT GGAAGATTCT CCTGTTGTTC CGCTTATTTA CTCGCATAAT
ACGTTTATTC ATGGAAGCGG TGTTGAGAAT TTCTACATCG GAAGTTTTCC TGCTTACCCT
AATTATGCGG CTGTGTCTTT GAATAGGTGA
 
Protein sequence
MFIFKNNTAS KKILASTAAF SLLTISLLSL SACTNYNSGK TSITNTSVLN SNPKKGGTFT 
IFTSNTNMNF DPARSQGLPI TSNNFIFRAL TTWKVNPDLS KQTRVVPDLA TDTGTTSDGG
KTWKYTLKKG VKYEDGTEIT SHDIKFGIER SFADSLSGGF GYHKTLLVGA ENYRGPFDRK
SLDSIETPDN QTIIFHLKAP FADWPWVTSL AAFVPVPTSS GDAQTYSKRP KSSGPYRIVQ
NEVGKQVVMQ RNKYWDSKLD STRTASADEI VWKLGSDTTV SAQSMIQGNT DTKTAFLADF
VPPAQLAQAQ AHPQSRKLLT TSSDGALEYL AINTRRITDI NVRKAIQYAV DKQSYRTAKG
GEIAGGFATT LITPGISGRK QFNLYSEDPR GNVEKAKQLL KKSGKTDIKL ILIARPDQTQ
VASSVQSSLK RAGIKVTIKT VDAVNFTDAI TSNSGDYDLA LASWQPDFPS AFANLGPLFD
SSQIGGGNWN ISRYSNPKVD ALIREAVQTV DENSAHKLWQ KADHAIMEDS PVVPLIYSHN
TFIHGSGVEN FYIGSFPAYP NYAAVSLNR