Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_4121 |
Symbol | |
ID | 4695052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 4520121 |
End bp | 4521686 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639851868 |
Product | extracellular solute-binding protein |
Protein accession | YP_998844 |
Protein GI | 121611037 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.157439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACTCA AGCTATCGAT GCTGTTCGCC GTCGTCTTGG CCGCGGCATC GGGCGGCACT TGCGTGCTGG CCAAAACCGC CAAAGACATA TTGGTGATCG GAAAATCGGC CGATCCGCAA AACCTGGATC CGGCCGTCAC GATGGACAAC AACGACTGGA CGGTCACATA CCCGGCGTAC CAGCGTCTGG TGCGCTACAA GGTCCATGAC GGCAAGGCTT CCAGCGAGTT GGAAGGCGAT TTGGCGCAAA GCTGGAGTAG TTCGCCAGAC GCGATGACCT GGGAATTCAA GCTCAAGCCG GGCAGCAAAT TCGCCGATGG TTCGCCGGTC GATGCCCATG CGGTGAAGTT CTCCTTCGAG CGTTTGCTGG CCTTGAAAAA AGGCCCTTCC GAACCCTTCC CCCCGGGCAT CGAGGTCAGC GCGCCCGACG CGTCGACCGT GCGCTTCAAG CTCAAGACCG GCTTTGCGCC TTTCCTGTCG ATCCTGGCCA TCGATGGCGC GTCGGTGGTC AACCCGAAAG TGATGCAATA CGAACAAAAC GGCGACAAGG CCCAGGGCTG GCTGGCGGGG CACACCATGG GCAGCGGCGC CTTCCAACTG AGCAGTTGGC AAAAGGGCCA AAGCATCGTG ATGGACAAAA GCCCGCATCC GAATGGAGCG GCGCCGGCCT TCAACAAAGT GATCATCAAG TTCGTGCCCG AGGCCTCGGC GCGCCGCCTG CAACTGCAAG GCGGCGACAT GGACATTGCC GAAGACCTGC CGCCCGACCA GATCGAAAGC CTGAAAGCGC AACAGGGCCG CCAGGGCGTC GTGGTGGGCG ACTACCCGAG CTTGCGCGTC ACCTACCTGT ACCTGAACAA CAAAAAAGCG CCGCTGGACA AGCCCGAGGT GCGCCGCGCC ATCATCGCCG CCGTCGATGT GCGCGCCATC ATCGACGGCA TTTTCTCGGG CAAGGCCAAG GCCATGAACG GGCCCATTCC CGAAGGCATG TGGGGGCACG ACGCGCAGGC TGCGCCCGCA GCCTTTGCGC CGGCCAAGGC CAGGGAACTG CTGGCCAAAG CCGGGCTGCG CAATATCCGG CTGGGCTTTT TGCTGTCGGA CAAAGACCCT TCGTGGAGCC CGATCGCGCT GGCCACGCAG TCCAACCTGG CCGATGTCGG CATCCAGGTG CGCCTGGAAA ACATGGCCAA TGCCAGCTTT CGCGAACGTG TCGGCAAGGG CGACTTCGAT ATCGCCATCG GCAACTGGAG CCCCGACTTT GCCGACCCCT ACATGTTCAT GAACTACTGG TTCGAGAGCG ACAAGCAGGG GGCCGCCGGC AACCGCTCCT TCTACTCCAA CCCCCGGGTC GATGCGCTGC TGGCCAGAGC GGCCCATGCG AGCGCCTTGT CCGAGCGCAG CAGGCTGTAC CAGGAGGCGC AAAAAATCGT GGTCGACGAT GCGGTCTATG TCTACCTGTT TCAGAAAAAC ACCCAGATCG CCGCGCGCAG CAGCGTCAAG GGGCTGGTGT TCAACCCGAT GCTCGAGCAG ATCTACAACG TCCAGCAGAT GTCCAAGTCC GAGTAG
|
Protein sequence | MKLKLSMLFA VVLAAASGGT CVLAKTAKDI LVIGKSADPQ NLDPAVTMDN NDWTVTYPAY QRLVRYKVHD GKASSELEGD LAQSWSSSPD AMTWEFKLKP GSKFADGSPV DAHAVKFSFE RLLALKKGPS EPFPPGIEVS APDASTVRFK LKTGFAPFLS ILAIDGASVV NPKVMQYEQN GDKAQGWLAG HTMGSGAFQL SSWQKGQSIV MDKSPHPNGA APAFNKVIIK FVPEASARRL QLQGGDMDIA EDLPPDQIES LKAQQGRQGV VVGDYPSLRV TYLYLNNKKA PLDKPEVRRA IIAAVDVRAI IDGIFSGKAK AMNGPIPEGM WGHDAQAAPA AFAPAKAREL LAKAGLRNIR LGFLLSDKDP SWSPIALATQ SNLADVGIQV RLENMANASF RERVGKGDFD IAIGNWSPDF ADPYMFMNYW FESDKQGAAG NRSFYSNPRV DALLARAAHA SALSERSRLY QEAQKIVVDD AVYVYLFQKN TQIAARSSVK GLVFNPMLEQ IYNVQQMSKS E
|
| |