Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3814 |
Symbol | |
ID | 4690899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 4208885 |
End bp | 4210837 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639851563 |
Product | extracellular solute-binding protein |
Protein accession | YP_998541 |
Protein GI | 121610734 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0812118 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGACA TTGGCGTTGC CGGCCCCGGC AGGGGCCCCT GCGCGCCGGT CGTTGGCCAC GGTTGCCTGG CGCGCGGGGA TACGAAACCG TCTGCGGCCG AAGACAAAAC CGCCTGTGGC TTAGGGCAGA ATTGTCGCAT GCGGTTTTGG CTGATTTTCT TGCTGATGGT GTGCGCTGCT CCGGCCCGGG CCGCGCATGG CTATGCGCTG TGGGGCGACC TGAAATACCC GCCCGGGTTT GCCCATTTCG ACTATGTGAA CCCGGCGGCC CCCAAGGGCG GCGAACTGCG CATGGTGAGC AACCTGCGCT ACTCCACCTT CGACAAGTAC AACCCGTTCA CGATGAAAGG CGCGGCGCCG GCCTACCTGG GCGATCTGCT GTTCGAGACC CTGCTCACCG GCTCGATGGA CGAGACCGCC TCGGCCTACG GCCTGCTGGC CGAGAGCGTG GAGGTCGCGG CAGACCGGAT GAGCGCCACT TTCCGGCTGC GCGCCGAGGC GCGCTTTCAC AACGGCGAGC CGGTGCAGGG CGCCGATGTC AAGCACAGCT ACGAGACCCT GACCGGCCCC CATGCCGCCC CCGGCTACGC CAGCATGCTG CAAGAGGTGG CGGGCCTGGA GGTGCTGGAT GCGCGCACGG TGCGCTTTCG CTTCAGGCAG CCGCAACGCG AGTTGCCGCT GACCGTGGGC GGCATGCCGA TTTTCAGCCG CGCCTGGGGC CGGCAGGCGG ATGGCCGGGC CAAGCGCTTC GACGAGATCG TCACCGACAC CCCGATCGGC AGCGGCCCGT ACCGCATCGG GCCGGTGGCA TTCGGCCGCG ACATCACCTA TGTGCGCGAC CCGCAATACT GGGGCCGGCA CCTGAACGTG AACCAGGGCG CTTACAACTT TGAGCGCATC ACCATCAATA TCTACAAGGA CAATACCGCG CGGCTGGAGG CGATGAAGGC CGGCGAGTTC GACTTCATGA CCGTCTATTC GGCCGGCGAC TGGGCGCGCC GGATCAACGG CAAGCTCTTC GACCAGGGCG TGCTGGCCAA AGCGGAAATG CCGCACCAAC TGCCTGCCGG GTTCCAGAGC TATGTGCTCA ACACACGCCG CCCGCTGTTG CAGGATGTGC GCGTGCGCCA GGCGCTGGAC CTGGCCCTGG ACTATGAGTG GATGAACCGC CGGATGTTCT ACGGCGGCTA CGCCCGGGTC CAGGACCTGT TCGGCAACAC CCGCTGCGCC GCCAGCGGCA GCCCCGGCCC CGAGGAACTG GCCTGGCTGA CGCCCTGGCG CGGCCAAGTG CCCGAGGCCG TGTTCGGCCC CATGTACACC CCGCCGATGA CCGAAGACGG GGGCCAGGGC CATTCGCTGC GCCAGAACCT GCGCCTGGCG CGCCAACTGC TGGCCGATGC GGGCTGGACC TACCGCGCCG GCGCGCTGCG CAACGCCCGC GACGAGCCGC TGGTGCTCGA ATACATGGAC AGCAAGGAGG TCGGCGTTCG CACCGTCGCA TCGTGGATGC GCAACCTGGA AAAGCTCGGC ATCAGGCTGC GCTTCGTGTC GGTGGACTTC GCCCTGTACC AGCAGCGCCT CGACAAGTTC GACTACGACA TCATCACCCT GAATTTTCCC GGCACCTACA ACCCGGGCCA GGCGATGCAG GAGTTGTTCG GCAGCCAAAG GGTGGATGTG GAAGGCGCCG CCAACTATGC CGGCGTGCGC AGCCCGGCGG TGGATGCGCT GGTCACGGCG CTCACCCGCG CGCGCAGCTT GGCCGAGTTG CTGCCCGTCT GCCGCGCGCT CGACCGCGTC ATCATGCACA GCCACTACCT GATCCCGCAA TGGCGCCTGG CCTCGCACCG CATCGTGTAC AACCAAAAGC GCCTGGCCTA CCAGCGGCCG ATGCCCCCCT ATGCCGATGC GCAGGCTTGG CTGATGTTCG CGTGGTGGAG CATCGGGTCG TGA
|
Protein sequence | MRDIGVAGPG RGPCAPVVGH GCLARGDTKP SAAEDKTACG LGQNCRMRFW LIFLLMVCAA PARAAHGYAL WGDLKYPPGF AHFDYVNPAA PKGGELRMVS NLRYSTFDKY NPFTMKGAAP AYLGDLLFET LLTGSMDETA SAYGLLAESV EVAADRMSAT FRLRAEARFH NGEPVQGADV KHSYETLTGP HAAPGYASML QEVAGLEVLD ARTVRFRFRQ PQRELPLTVG GMPIFSRAWG RQADGRAKRF DEIVTDTPIG SGPYRIGPVA FGRDITYVRD PQYWGRHLNV NQGAYNFERI TINIYKDNTA RLEAMKAGEF DFMTVYSAGD WARRINGKLF DQGVLAKAEM PHQLPAGFQS YVLNTRRPLL QDVRVRQALD LALDYEWMNR RMFYGGYARV QDLFGNTRCA ASGSPGPEEL AWLTPWRGQV PEAVFGPMYT PPMTEDGGQG HSLRQNLRLA RQLLADAGWT YRAGALRNAR DEPLVLEYMD SKEVGVRTVA SWMRNLEKLG IRLRFVSVDF ALYQQRLDKF DYDIITLNFP GTYNPGQAMQ ELFGSQRVDV EGAANYAGVR SPAVDALVTA LTRARSLAEL LPVCRALDRV IMHSHYLIPQ WRLASHRIVY NQKRLAYQRP MPPYADAQAW LMFAWWSIGS
|
| |