Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_0531 |
Symbol | |
ID | 4692777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 596619 |
End bp | 598499 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639848309 |
Product | extracellular solute-binding protein |
Protein accession | YP_995333 |
Protein GI | 121607526 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.716167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAC TCATTCACGC GAGCATCGCG CTGTGCATTG CATGCGCAGC CAACACCCCG GCGCTGGCCG AAAACATGAC CGATGTCGGA ACGCCGCGCA AGGAAACCCT GGTGGTCGAC ATCCTCACGG GCCGGGTTGG CAACCCCAAG CGCATGAACC CCTATCTCGA AGGCAACAGC CTGGTTCAGG GATTGCACCA GCTCGGCTAC AGCAACCTGT GGGAGATAGA TACCGTCAAG GGCGTCCAGT ACCCGGCGCT GGCCGCCACC ATGCCCGAGG CCATCGATGG CAAGAACACC CGCTGGCGCT TCAAGGTGCG CAAGGGGCTG GCGTGGAGCG ACGGCGTGCC ATTCAGCGCC GCCGACGTGG CCTATACCGC GCAGATGATC ATCGGCAACC CCAAGCTGCC CTACAACCGC TTCCTGGAAA AGAACATCAA GAGCATCCAG GCGATCGACG CAGAAACGGT GGAACTGGAG ACGCTCAGTG CGATGCCGAA AATCGCCTAC ATGTTCGGTT CCGTGATCTT CGGCAACGGC TTTCGCGTGC TGCCCAAGCA TGTCTGGGAA AAGGTCGATC CGGCGACGTT CGCCAACTTC CCGCCGGTGA CCATCGGCCC GTACAAGCTC AAGGAGGTGG ACGACAACGG TTTTTGGTTC CTGTGGGAGA AGCGGGCCGA CTGGCAAAAG ACCGACGCCG GACAAATCGT CGGCGAGCCC AAGCCAAAGT ACGTCCTGTT TCGCTCCTAC GGCACGGAAG AAAAGCGGAT CATGGCAATG GCCCGGAACG ACATCGATGT GCTGACCGAC ATCACGCCCG AAGGCTTGGA CATCCTGCGG CAAAGGAATG CCAAGGTGAA GTCCTGGTAC GACGCCTTCC CCTGGGCCGC CCTGGACGAC CCCTGCGAGC GGGGCATCTC GTTCAACACC TCGAGCGCCC CCTACGACCA ATGGCAGGTG CGCTGGGCGC TGGCGCTGGC GACCCGGATC GACAACGTCA GCATCGCGAC ATTCTCCGGG ATGATGCGCG CCTCGCCGCT GCATGCGCCG CCGATCTCGA TCCTGATGAA AACCTACCAC GAGCCGATGC TGCCATGGCT CAAGGGCTTT GCGCTGCCGG ACGGCTACAA GCCTTTCGAT TCCGAATTCG CCATCCGGAT GGGCAAGCGC CTGGCCGCCG AAAAGATACC GGGCCTGCCG ACGGGCGACG CCGAACTACG CAAGCTGTTT GGCGTCGGCT GGTGGAAGTT CGACCCTGCG AAGGCCGCCG CACTGCTGCA AAGCGTGGGC TTTCGCAAGG CCGACGCCGG ATGGCTGCTG CCCGATGGCA AGCCCTGGAA GATGACGATC AACGCGCCGG CCGACTTCGA GATCCAGTCC CAGCGCCTGG CCTTCGCCGT GGCCAATGAA TGGAAGAAAT TCGGCATCGA TGTCAATGTG CAGCAGCAGC AAGGCGGCGT TTTCACCACC GAGTACGCCT CCGGCAACTT CCAGGCCGGC GCCTACTGGA ACCAGACCTG CGCGATCGGC CCCGATCTGT GGGTTCGACT GGAATGGTGG CACGAAAAGT ACGTCAGCCC CAACGGTCAA CCGGCCGCGT TCAACCGCGA ACGCTACACG AACCCGAGCC TGAGCCACAC CATCGACCAG ATGGCGATTT TGCAGCCGAC AGACCCGAAG AATGTGGAAC TGGGCACGGC GCTGTTGAAG GAACTGGTGG CCGGCATGCC CGTCATCCCG ATGTTCGGCA CGTCCAAGTT CGTGCCGGTC AACACCACGT ACTGGAGCAA CTTTCCTTCT GCCAGCAACT ACTATGAAGG GCCTTGGTGG TGGTGGTCGA ACTTCAAGTA CATCGTCGCC AGACTCAGGC CGGCGCCATG A
|
Protein sequence | MKRLIHASIA LCIACAANTP ALAENMTDVG TPRKETLVVD ILTGRVGNPK RMNPYLEGNS LVQGLHQLGY SNLWEIDTVK GVQYPALAAT MPEAIDGKNT RWRFKVRKGL AWSDGVPFSA ADVAYTAQMI IGNPKLPYNR FLEKNIKSIQ AIDAETVELE TLSAMPKIAY MFGSVIFGNG FRVLPKHVWE KVDPATFANF PPVTIGPYKL KEVDDNGFWF LWEKRADWQK TDAGQIVGEP KPKYVLFRSY GTEEKRIMAM ARNDIDVLTD ITPEGLDILR QRNAKVKSWY DAFPWAALDD PCERGISFNT SSAPYDQWQV RWALALATRI DNVSIATFSG MMRASPLHAP PISILMKTYH EPMLPWLKGF ALPDGYKPFD SEFAIRMGKR LAAEKIPGLP TGDAELRKLF GVGWWKFDPA KAAALLQSVG FRKADAGWLL PDGKPWKMTI NAPADFEIQS QRLAFAVANE WKKFGIDVNV QQQQGGVFTT EYASGNFQAG AYWNQTCAIG PDLWVRLEWW HEKYVSPNGQ PAAFNRERYT NPSLSHTIDQ MAILQPTDPK NVELGTALLK ELVAGMPVIP MFGTSKFVPV NTTYWSNFPS ASNYYEGPWW WWSNFKYIVA RLRPAP
|
| |