Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_4964 |
Symbol | |
ID | 4691902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 5493245 |
End bp | 5494960 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639852700 |
Product | extracellular solute-binding protein |
Protein accession | YP_999669 |
Protein GI | 121611862 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000193242 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGA ACAGGAACTT GCTCGCCCGT CTGGCCCCGC TGGCTGCCGG CATTGCCATG GCGGCTGCAG CCGGCACCGC GCAGGCCCAG GCCCACGCCG AGACGCCCAA GTACGGCGGC TCGCTGGATG TGGCCACCAC GAGCACGGGC GTCGCGACCC TGTCCTGGGA CCTGGCCGAC TGGCAGGCCA CACTGCAGAC GCGTGACACG GGCCAGTTCT ACGAGCAGCT GTTCTCGGTC GATCTCTCGA AGGCCAAGAG CCGTGGCGGC AAGTACCCGT TCACGATCAG CGGCTGGCAG CCCACCGACG CCGTGCGCGG CGAACTGGCC GAAAGCTGGA ATTGGGTCAC GCCGCTGGTG CTGGAGGTCA AACTGCGCCA GGGCGTGAAG TTTCCGGCCA AGCAGGGCGT GATGGCCGAG CGGGAGTTCG TGGCCGACGA CGTGGTCTAC AGCTTCAACC GGCTCAACAA CAGCCCGAAG AAGACCCAAG GCTACTACGA CCACCTGGAC AAGGTCGAGG CCAAGGACAG GCACACGCTC GTCTTCACGT TCAAGCAGTT CCTGGCCGAC TGGGACTACC GCTTCGGCAA CGGCTTCTTC TCCGGCATCA TGCCCAGGGA GGTCACCGAG GCCGGCGGCG GCAACTGGAA GAACGCCAAC GGCACGGGCC CCTTCATGCT CACGAACGTG GCGCAGGGTA ATTCGCTGAC CTTCACCAGG AACCCGATCT ACTGGGATCA GGAAGTCATC GGCGGCAAGC CCTACAAACT GCCCTTCGTC GACAAGCTCA CGCACCGCGT GATCAAGGAC GAGGCCACGC AGCAGGCCGC GCTGCGCACC GGCAAGCTCG ACATCCTGTC CTCGATCTCC TGGGAGGCCG CGCGCGAGCT GAAAAAGAGC ACGCCGCAGT TGCAGTGGAA CCGCTGGCTG TCCATGGCCG CCGCGCGCGT GGCGCTGCGC GTGGACACCA AGCCCTTCAC CGACGTGCGC GTGCGCCGCG CGCTGAACAT GGCGGTCAAC AAGCAGGAAA TCATCGACAA GTTCTACGGC GGCGAAGGCG AGATGTTCGT GTTCCCGATG AACCCGGCAT ACGTGGGCTA CCACACGCCG CTGAAGGACA TGCCGGCCTC GGTGCAGGAA CTGTTCAAGT ACGACCCGGC CAAGGCCAGG AAACTGCTGG CCGAGGCCGG CTACCCCAAG GGCTTCGCGT TCAAGATGCA GATCTGCACC TGCACCACCG AGCAGATGGA GCTGATGCCG CTGGTCGCGG CCTACCTGGA GATGGTGGGC GTGAGGATGG AGATCGTGCC CATGGAATAC GGCGCGCACC TGTCGGCCAT GACCAGCCAC GTCAACGGCG CGGGCTACCT CACGACCGTT TCCGACGTGA ACCCCACGAC CTCGCTGCGC ATCAACTTCG GCAAGGGCCA GGTCTACAAC GCGCCGATGT GGAACGACGC CAGGTTCGAC GCGCGTGTGG CCGATGCGCT GGCCGAGCGC GACGAGCCCA GGCGCCAGCA GATCCTGCGC GAGCTGACCG CGCAGATCGT CGAGCAGGCG CCCGCCATCT GGATGCCGGC CCCGTACCGC TACACGGCCT GGTGGCCCTG GGTCAAGAAC TACGGCGGCG AGCTGTTCGT CGGCGCCGGC CGCAGCGCGC CCATCCATGC GCGTGTCTGG ATCGACCAGG ACCTGAAGAA GACGCTCGGC TTCTGA
|
Protein sequence | MKTNRNLLAR LAPLAAGIAM AAAAGTAQAQ AHAETPKYGG SLDVATTSTG VATLSWDLAD WQATLQTRDT GQFYEQLFSV DLSKAKSRGG KYPFTISGWQ PTDAVRGELA ESWNWVTPLV LEVKLRQGVK FPAKQGVMAE REFVADDVVY SFNRLNNSPK KTQGYYDHLD KVEAKDRHTL VFTFKQFLAD WDYRFGNGFF SGIMPREVTE AGGGNWKNAN GTGPFMLTNV AQGNSLTFTR NPIYWDQEVI GGKPYKLPFV DKLTHRVIKD EATQQAALRT GKLDILSSIS WEAARELKKS TPQLQWNRWL SMAAARVALR VDTKPFTDVR VRRALNMAVN KQEIIDKFYG GEGEMFVFPM NPAYVGYHTP LKDMPASVQE LFKYDPAKAR KLLAEAGYPK GFAFKMQICT CTTEQMELMP LVAAYLEMVG VRMEIVPMEY GAHLSAMTSH VNGAGYLTTV SDVNPTTSLR INFGKGQVYN APMWNDARFD ARVADALAER DEPRRQQILR ELTAQIVEQA PAIWMPAPYR YTAWWPWVKN YGGELFVGAG RSAPIHARVW IDQDLKKTLG F
|
| |