Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3852 |
Symbol | |
ID | 4694391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 4249163 |
End bp | 4250728 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639851601 |
Product | extracellular solute-binding protein |
Protein accession | YP_998579 |
Protein GI | 121610772 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0773863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.697886 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGACT TGCAATCACC GATCCGGTGC TGGAGCCTGA CCCTGCGCGG CGCGGTCGCC ACGGCAGTCC TGGCCGGTCT GGCGGCCACT ACCTTGGCCC GGGCACAGAC GGAAACGCCC AAACGCGGCG GTACCTTGAA CATCGGCGTT GAGGCCGATC CGGCGTCGCT TGACCCCTTG CGGCTGGGCT CCTATGTCGA GCGTCAGTAT GCGATGGCGG TTTTCGACAC GCTGCTAGAC ATTGACGCCA GCGGCAAGCT GGTGCCCAGC TTGGCTACGG CCTACGAGAT TTCGCCGGAT GGGAGGGCCT ACACCTTGAA ACTGCGCCAG GGTGTGACTT TCCACGATGG CACGCCCTTC GATGCGGATG CCGTGGTCTA CAACCTGGAC CGTGTGCGTG ATCCGGTCAA CAACTGCCGA TGCCTGGCCA ACATCAGCAG TGTCGAATCG GTCAAAGCCG TTGATACGCA TACGGTCGTG GTCCGGCTGC AAGCGCCAAG TGTTGCTTTC GCGGCATTGC TGGCGGATAC GGCTGCCATG ATGGCCTCGC CGAAAGCCCT CAAGGCCGAT CCGGTGGGGT TCGGCAATGT CCCGGTTGGC ACCGGGGCAT TCAAGCTGGT TGAATGGGTC AAGGGCTCAC GCTTTGTGGG TGAGCGCAAT CCCAACTACT GGCGGCAGGG ACAGCCCTAT GCGGATCGGC TGATCTACCG TGGGCTGCAA AGCAACGAGA CGCGCGAATC CACGTTCCAG TCCGGAGCAC TCGACATCCT GACCCAGGCT TCGCCCAAGT TCGTCGCTGC AGCTAAAAAG GACAGGCGAT TCAAGGTCCT GGAGCCTGAT GGTTTCGGCT CGATATTCAT CGCCTTGCGG ACGAAGCATC CGGCGCTGGC GGATATGCGC GTGCGCAAGG CGATCGCTCA CGCAACGCAA CGCGAGTTGC TCGTGAAGGC TGTGTACCAG GGTATGTACA AGGTCGCTAC CACACCCTTT GGTGAGGGGC TGCCAGGCCT GTCGCCAGTC ACGGACTACC CAGCCTATGA CCTGGACAAG GCGAAGGCAC TGCTGGCCGA ATACGGGCAA GCGGTGGAAC TGCGCCTGAT GATGGACAAC ACCCCTATAG CGTTGCAGGC CGCGCAAGCG CTGCAGCAGA TGTGGCAGCG TGCGGGCATC AAGGTGACCC TTACGCCAGT CGATCAGGCG CGGCTGGTGC AAAACATGCT CACCCACGAA TTCGATACGA CGCTGTTTCG CTGGTCTGGC CGTCCCGACC CCGATCTGAA CGCCTACACC TTCTTCCATT CCAGCAATGC TGAGAAGAAG ATCTCGTCCA ACTATATCCA ATACGCGAAT CCCGAAATGG ACCGTTTGCT CGACGCGGGC CGCATGGAGA TGGATCCGAG CAAACGCAAT CAAATCTATA ACCAGATTTC ACAACTGCTG GCCAAGGATT TGCCATACGT GTTCCTGGCC TACATCACCG CTCCCATCGT CACGACGCAG GCCGTGCGTG GGGTAGAACT GGTACCAGAC TCACTGATTC GCGTAGGCGC GGTCTGGAAA GAGTAA
|
Protein sequence | MFDLQSPIRC WSLTLRGAVA TAVLAGLAAT TLARAQTETP KRGGTLNIGV EADPASLDPL RLGSYVERQY AMAVFDTLLD IDASGKLVPS LATAYEISPD GRAYTLKLRQ GVTFHDGTPF DADAVVYNLD RVRDPVNNCR CLANISSVES VKAVDTHTVV VRLQAPSVAF AALLADTAAM MASPKALKAD PVGFGNVPVG TGAFKLVEWV KGSRFVGERN PNYWRQGQPY ADRLIYRGLQ SNETRESTFQ SGALDILTQA SPKFVAAAKK DRRFKVLEPD GFGSIFIALR TKHPALADMR VRKAIAHATQ RELLVKAVYQ GMYKVATTPF GEGLPGLSPV TDYPAYDLDK AKALLAEYGQ AVELRLMMDN TPIALQAAQA LQQMWQRAGI KVTLTPVDQA RLVQNMLTHE FDTTLFRWSG RPDPDLNAYT FFHSSNAEKK ISSNYIQYAN PEMDRLLDAG RMEMDPSKRN QIYNQISQLL AKDLPYVFLA YITAPIVTTQ AVRGVELVPD SLIRVGAVWK E
|
| |