Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0438 |
Symbol | |
ID | 8709095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 479652 |
End bp | 481361 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 646482553 |
Product | extracellular solute-binding protein |
Protein accession | YP_003373683 |
Protein GI | 283782929 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000302418 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCATTT TCAAAAATAA TACGGCTTCA AAGAAAATTC TTGCTTCAAC AGCAGCTTTT TCACTACTTA CAATTTCACT ACTTTCGCTT TCAGCTTGCA CAAATTACAA CTCTGGGAAA ACCAGTATTA CGAACACTTC CGTGCTAAAT TCTAATCCTA AAAAAGGCGG AACTTTTACA ATATTTACGT CAAATACGAA TATGAATTTC GATCCAGCGC GCAGTCAGGG TTTGCCAATT ACGTCAAATA ATTTCATCTT CCGCGCGCTT ACTACTTGGA AAGTAAATCC GGATTTAAGC AAGCAAACGC GTGTAGTTCC AGATTTGGCA ACGGATACTG GCACAACAAG TGACGGCGGA AAAACTTGGA AATACACTCT TAAAAAAGGT GTTAAGTATG AAGATGGAAC CGAAATCACT TCGCACGATA TTAAATTTGG TATTGAGCGC TCGTTTGCAG ATTCGTTAAG CGGTGGTTTT GGATATCACA AAACTCTTCT TGTTGGAGCT GAAAACTATA GAGGACCATT CGACAGAAAG TCGCTTGATT CGATTGAAAC TCCAGATAAT CAAACGATTA TTTTCCATTT GAAAGCACCT TTTGCAGATT GGCCGTGGGT GACATCTTTG GCTGCATTTG TGCCAGTTCC AACTAGTTCA GGCGATGCTC AGACATATAG TAAAAGGCCG AAATCTTCAG GACCATACAG GATTGTGCAG AACGAAGTCG GAAAGCAAGT TGTAATGCAA CGCAATAAGT ATTGGGATTC CAAGCTCGAC AGTACTCGCA CAGCTTCTGC AGATGAAATT GTGTGGAAAC TTGGTTCAGA TACTACTGTT TCAGCTCAGT CTATGATTCA AGGAAATACG GATACAAAAA CCGCATTTTT GGCTGATTTT GTGCCGCCTG CTCAGCTTGC ACAAGCTCAA GCGCATCCTC AATCTCGCAA GTTGCTAACT ACTAGTAGTG ATGGTGCTCT TGAATATTTG GCTATTAATA CGCGCAGAAT AACGGATATC AATGTTCGAA AAGCGATTCA GTATGCTGTT GATAAGCAGT CGTATCGAAC TGCGAAGGGT GGCGAAATTG CTGGAGGTTT TGCTACTACG CTTATTACGC CTGGTATTTC TGGTCGAAAG CAGTTTAATT TGTATTCTGA AGATCCTCGA GGGAACGTTG AGAAAGCTAA ACAACTGTTG AAAAAATCAG GTAAAACCGA TATTAAGTTG ATTCTGATTG CTCGTCCGGA TCAAACGCAA GTGGCATCTT CTGTGCAATC AAGTTTAAAG CGAGCTGGTA TTAAAGTGAC GATTAAAACT GTTGACGCAG TAAACTTTAC GGATGCAATT ACGTCAAATT CCGGAGATTA CGATTTGGCT TTGGCTTCGT GGCAGCCTGA TTTTCCTTCT GCTTTTGCGA ATTTGGGACC GCTTTTTGAT TCTTCGCAAA TTGGCGGCGG TAACTGGAAT ATTTCTCGTT ATTCGAATCC TAAAGTAGAT GCTTTAATTC GCGAAGCTGT GCAAACTGTT GACGAGAATT CGGCTCATAA ACTGTGGCAA AAGGCGGATC ACGCTATTAT GGAAGATTCT CCTGTTGTTC CGCTTATTTA CTCGCATAAT ACGTTTATTC ATGGAAGCGG TGTTGAGAAT TTCTACATCG GAAGTTTTCC TGCTTACCCT AATTATGCGG CTGTGTCTTT GAATAGGTGA
|
Protein sequence | MFIFKNNTAS KKILASTAAF SLLTISLLSL SACTNYNSGK TSITNTSVLN SNPKKGGTFT IFTSNTNMNF DPARSQGLPI TSNNFIFRAL TTWKVNPDLS KQTRVVPDLA TDTGTTSDGG KTWKYTLKKG VKYEDGTEIT SHDIKFGIER SFADSLSGGF GYHKTLLVGA ENYRGPFDRK SLDSIETPDN QTIIFHLKAP FADWPWVTSL AAFVPVPTSS GDAQTYSKRP KSSGPYRIVQ NEVGKQVVMQ RNKYWDSKLD STRTASADEI VWKLGSDTTV SAQSMIQGNT DTKTAFLADF VPPAQLAQAQ AHPQSRKLLT TSSDGALEYL AINTRRITDI NVRKAIQYAV DKQSYRTAKG GEIAGGFATT LITPGISGRK QFNLYSEDPR GNVEKAKQLL KKSGKTDIKL ILIARPDQTQ VASSVQSSLK RAGIKVTIKT VDAVNFTDAI TSNSGDYDLA LASWQPDFPS AFANLGPLFD SSQIGGGNWN ISRYSNPKVD ALIREAVQTV DENSAHKLWQ KADHAIMEDS PVVPLIYSHN TFIHGSGVEN FYIGSFPAYP NYAAVSLNR
|
| |