Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_2011 |
Symbol | |
ID | 4692688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 2285200 |
End bp | 2286642 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639849777 |
Product | extracellular solute-binding protein |
Protein accession | YP_996781 |
Protein GI | 121608974 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.427162 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAG CACTCAAAGC CCTCGCACGG GCATGCCACG ATCGGCGGCT GGAGCGTCGC GAGTTTCTGG CGCGCGCCAG CGCCCTCGGT TTTGGCAGCA GCGCCGCCGG CCTGATGCTC AATGCCGTGT CGACCCGGGC CCTCGCCCAG GACGGCGGCG TCGACTTCAT GAAGCACAAG GGCAAGACCG TCAAACTGCT GCTGAACAAG CATCCGTATG TGGATGCGAT GGTCAAGAAC ATCGAGAACT TCAAGGCCCT GACCGGCTTG AATGTCAGCT ACGACATCTT TCCGGAAGAT GTCTACTTCG ACAAGGTGAC CGCAGCCCTG GCCAGCAAGA GCAGCCAGTA CGACGCTTTC ATGACCGGCG CCTACCAGAC CTGGAAGTAC GGCCCGGCGC GCCAGATCGT CGACCTGAAC CAGTACCTGC AAGACCCCAA GCTCACCTCG GCCAACTACG CCTGGGAGGA TATCTACCAG AACCTGCGCG CCGCCACGTC CTGGGACGGC AAGGCCGGCT CCGCACTCGG CGGCCCGGGC GCCAAGCAAT GGGCCTTGCC CTGGGGCTTC GAGCTCAACA GCCTGGCCTA CAACAAGCGC CTGTTCGATG CGCTGAAACT GGGCGTGCCG ACCCACCTGG CGGACCTGGC GGACAAGGCC GCCAGCATCA GCAAGAGCGG CAAGGGCTAC GGCATCGGCG TGCGCGGCTC GCGCAGTTGG GCCACGATCC ATGCCGGCTT TTTGTCGGCG TACACCAACT TCGGCAACAA GGACTTCCAC AGCGCGGGCG GCAAGCTGAC GCCGGCAATG AACACGCCGC AGAGCAAGCA GTTTCACCAG CAGTGGATCG ACATGATCAA GAACGGCGGG CCGAAGAACT GGACCAACTA CACCTGGTAT GAGGTCGGCA ACGATCTGGG CGCGGGCAAT AGCGCGATGA TCTACGACGC CGACATCATG GGCTACTTCT TCAACAGCGG CAGCAACAAG GAAGCCGGCA ACCTGGCCTA CGCCGCGTTC ACGCCGAACC CGGCCGCCAA GGCGCCCACG CCCAATATCT GGATCTGGTC GCTGGCGATG AGCGAGTTCT CCAAACAAAA GGAGGCCGCC TGGTTCCTGC TGCAATGGGC CACCGGCACG CAGAACACCA CCTTCGGCGC CACCCAGGGC GACTTGGTGA ACCCGGTGCG CAAATCGGTC TGGGAAAACG CCCAGTTCAA GGAGCGGCTG GACAAGTCCT ACCCCGGCTA CCTGCGGCAA TACCAGGCCA GCGTGGAGGG CGCGAAGATC TACTTCACGC CGCAGCAGTT GTTTCCCGAA TTCACCACCG AGTGGGCGTC GATGCTGCAA CAGATGTACG GCGGCACGGT GCCGGTCGGC GAGGGGCTGG ACAAACTGGC CGAGACGCTG ACCCGCAAGC TCAAGGGCGT GGGCCTGGCC TGA
|
Protein sequence | MSEALKALAR ACHDRRLERR EFLARASALG FGSSAAGLML NAVSTRALAQ DGGVDFMKHK GKTVKLLLNK HPYVDAMVKN IENFKALTGL NVSYDIFPED VYFDKVTAAL ASKSSQYDAF MTGAYQTWKY GPARQIVDLN QYLQDPKLTS ANYAWEDIYQ NLRAATSWDG KAGSALGGPG AKQWALPWGF ELNSLAYNKR LFDALKLGVP THLADLADKA ASISKSGKGY GIGVRGSRSW ATIHAGFLSA YTNFGNKDFH SAGGKLTPAM NTPQSKQFHQ QWIDMIKNGG PKNWTNYTWY EVGNDLGAGN SAMIYDADIM GYFFNSGSNK EAGNLAYAAF TPNPAAKAPT PNIWIWSLAM SEFSKQKEAA WFLLQWATGT QNTTFGATQG DLVNPVRKSV WENAQFKERL DKSYPGYLRQ YQASVEGAKI YFTPQQLFPE FTTEWASMLQ QMYGGTVPVG EGLDKLAETL TRKLKGVGLA
|
| |