Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_4734 |
Symbol | |
ID | 4691379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 5223936 |
End bp | 5225501 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639852477 |
Product | extracellular solute-binding protein |
Protein accession | YP_999447 |
Protein GI | 121611640 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.676308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAA ATCTCATGAC GAAGGTCGCC CATCGGTTCT GGCTGAGGAT GCTGCCGCAG GGCATGGCGT GCTGCGCCTT GCTGCCGGCC TTCGTGCCGG CGCCTGCGTT GGCGGGCAAG GCCAATGACA CGCTGGTGTA TGCCTCTGAC AGCGAGGCGC CCAACATCAG CCAGTACCAC AATAACGTGC GCGAAGGCGT GATTCTGGCG CACCTGATCT GGGATACGCT GGTCTGGCGC GACCCCCGCA CCGGCGACTA CAAGCCGCAG TTGGCCAGCG CCTGGAAGTG GGAGTCGCCG ACGGCGCTGG TGATGGACTT GCGCCAGGGT GTGCAATTCC AGAATGGCGA TCCGTTCACG GCGGAGGATG TTGCCTTCAC CTTCAACTAT GCGGTGTCGA GCGAATCCAG GGTCATCACA CGGCAAAACG TCGATTGGAT CAAGAGCGTC GACAAACTCG GCGACTACAA GGTGCGCATC AACCTCAAGC AGCCCTTTCC GGCTGCGCTG GAATACCTGG CCGGTCCCTT GCCGATTTAT CCCGGCGCGT ATTTCAGGAA AGTCGGCCTG GAAGGATTCG CCAAGGCGCC GATCGGCACC GGCCCTTACA AGGTCGTCAG CGTGACGCCG GGGCGAGGCG TGAGCATGGT CAGGAACGGC AATTATTTCA AGGACAGTCC GCAAGGCCAG CCGAAGATTG GCAATATCAA ATTTGTCGTC ATTCCCGACC CTGAAACACG CTCGGCACAA CTGATGACCG GCGCGATCGA CTGGATTTGG CGCGTGCCCG CCGATCAGGC CGAGTCGCTC AAGAGCACGC CCGGGATCAC CGTGCAAAGC GGCGAAACCA TGCGCGTCGG TTTTCTGGTG ATCGATGCGG CGGGCAACTC CTCGCCCCAT TCGCCGTTCA AGGATGTGCG TGTGCGCCAG GCGGTCAACC ATGCGATCAA CCGCCAGGGT ATCGCCGACA ATCTGGTGCG CGGCGGCAGC AAGCCGGTCT ACACCGCCTG TTTCCGCACC CAGTTCGGTT GCGATGACAA GGTGGTGGTC CACTATGACT ACAACCCCGC CAAGGCGAAA GAGTTATTGC GCGCCGCTGG TTACGCCAAC GGTTTCGACA CCGATTTGTA TGCCTATCGC GAGCGCGAGT TCGCCGAAGC CATCGTCGGC GATTTGCACA AGGTGGGCAT TCGCGCACGG CTGCATTACA TGAAGCACGA TGCGATGCAG GTCGAGTACC GCGGCGGCAA GGCGCCGATG ACGTTTTATG CCTGGGGCTC GTACTCGATC AATGACACGT CCGCCTTTAC CGGCGTGTAT TTCAAGGGCA GCAGCGACGA CATCGTCAAG GACCCGCAAC TGCGCCAATG GCTGGAAACG GCCGACACCT CGACCGACCC TGCCGTGCGC AAAACGAATT ACGCCAAGGC ATTGGCGTTG ATTTCCCGGC AGGCTTATCT GGCGCCGATG TTTTCTTATT CCACTTACTA CGCCCATAGC TCGGCGCTCA GGTTTCAGGG ATATCCCGAC GAGTTGCCGC GTTTTCATGA GGCCAGTTGG AAGTAG
|
Protein sequence | MNANLMTKVA HRFWLRMLPQ GMACCALLPA FVPAPALAGK ANDTLVYASD SEAPNISQYH NNVREGVILA HLIWDTLVWR DPRTGDYKPQ LASAWKWESP TALVMDLRQG VQFQNGDPFT AEDVAFTFNY AVSSESRVIT RQNVDWIKSV DKLGDYKVRI NLKQPFPAAL EYLAGPLPIY PGAYFRKVGL EGFAKAPIGT GPYKVVSVTP GRGVSMVRNG NYFKDSPQGQ PKIGNIKFVV IPDPETRSAQ LMTGAIDWIW RVPADQAESL KSTPGITVQS GETMRVGFLV IDAAGNSSPH SPFKDVRVRQ AVNHAINRQG IADNLVRGGS KPVYTACFRT QFGCDDKVVV HYDYNPAKAK ELLRAAGYAN GFDTDLYAYR EREFAEAIVG DLHKVGIRAR LHYMKHDAMQ VEYRGGKAPM TFYAWGSYSI NDTSAFTGVY FKGSSDDIVK DPQLRQWLET ADTSTDPAVR KTNYAKALAL ISRQAYLAPM FSYSTYYAHS SALRFQGYPD ELPRFHEASW K
|
| |