Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3881 |
Symbol | |
ID | 4694340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 4275027 |
End bp | 4276574 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639851628 |
Product | extracellular solute-binding protein |
Protein accession | YP_998606 |
Protein GI | 121610799 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.441306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0413131 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGTG CACTGCTCGC CGCGGCCTTG GCCCCGGCAC TGGCGCAGGA ACTCAGGATT GGCGTGCGCG CCGGCCCGGA AGCGATGGAC CCGCACTACC TGGCGCTAGG CACGCAAATC GCCGCGATAA AAAACATCTA CGAAGCCCTG GTGTTTCAGG ATGATAAACT GCAAATCCAG CCCGGCCTGG CGACCAGTTG GAAAGTGATC GACGAGCACA CCTGGGAATT CAAACTGCGC CCCGGTGTGA AATTTCACGA TGGCAGTGTG CTGACGGCGC AGGATGTCAA GTTTTCTCTG GAGCGCGTTC CCAAGGCGGC GGGGCCGGAT GCCGGCCTGG TCATCAATAC ACGCAATATC AGCAAGGTCG ATGCCGTGGA TGATTTGACC GTCCGTGTCA TAACCTCGAT GGCCAACCCC GCCTTGCCAC AAGACCTGGC GCGCATCATG ATCGTGCCCG CCTCCATCGG CGCGGCCAAG GTCGCCGATT TCAACAGCGG CAAGGCAGCC ATTGGCACCG GCCCCTTCAA ACTGATCTCG TTCAAACCGC GCGCCGATTT GTTGCTGGAG CGCTTCGACC AATACTGGCG CGGCGCGGCA GACTGGAAGA AAGTGCATTT TCTCGAAATC AGCAACGATG CTGCGCGCCT GGCCGCACTG TCATCCAAAC GGGTCGACCT GATCAACTAC CTGCCTTACG GCGATGTCGC CAAACTCAAA AACAATCGCG ATTTCTCCGT GGTGCAGGGC GACTCCATCT ATATCTACCT GCTTTACCCG GACGTGCGCG AAAAGAGCGA GTTAATCACC GACAAGGCGG GCAAACCACT GCCAGTCAAC CCGCTGCGCG ACAAGCGCGT GCGTGAAGCG CTATCGCTGG CCATCGACCG CAAGACCATC GCCGGCACCG TACTCGAAGG CATGGCCACG CCATCCAATC AACTGATCGT GGACGGCTTC TTTGGTGCCC TGTCCAAACC ACCGTTGTTG CCAACGAATA TCGACAGGGC CAAAAAATTG CTGACCGAAG CAGGCTACCC GAACGGCTTC TCGCTGCCCC TGCACTGCAC CAGCGACCGC TTGCCAGGAG ATGCGGCGAC CTGCTCGGCG CTAGGGCAGA TGTTTGCCAA AATCGGCATC GACACGAAAA TCAACGCCAT TTCGCGCACC GTGTTCATTC CGGCGCGCAA GCGTGGTGAA TACGTGCTAT CACTGGCCGG TTGGGGTTCA CTGACCGGAG AAGCGGGCTA TACCTTGGCA GCCATTGCCC GCACCAACGA CAAATCCAAA GGTTTCGGCG CATCCAACGT GACGAATTAT TCCAATGACG CTGCTGATGC AGCAATATCG GTCGCCATGC GCATGACCAA CGACGACAAG CGCCGGCTAT TGTTTGAAAG CGCAATGAAA ATGGTGCAGG ACGACTACGC CATCATTCCG GTCGTGCAGT TGTCCTCGGT ATGGGCCGCG CGCGCCAATA CCTTGTCTTT CAAGCCGCGT GTGGATGATG AGACGCTGCC GTTTTTCATC CGGCTGGACA AGAAGTAA
|
Protein sequence | MMSALLAAAL APALAQELRI GVRAGPEAMD PHYLALGTQI AAIKNIYEAL VFQDDKLQIQ PGLATSWKVI DEHTWEFKLR PGVKFHDGSV LTAQDVKFSL ERVPKAAGPD AGLVINTRNI SKVDAVDDLT VRVITSMANP ALPQDLARIM IVPASIGAAK VADFNSGKAA IGTGPFKLIS FKPRADLLLE RFDQYWRGAA DWKKVHFLEI SNDAARLAAL SSKRVDLINY LPYGDVAKLK NNRDFSVVQG DSIYIYLLYP DVREKSELIT DKAGKPLPVN PLRDKRVREA LSLAIDRKTI AGTVLEGMAT PSNQLIVDGF FGALSKPPLL PTNIDRAKKL LTEAGYPNGF SLPLHCTSDR LPGDAATCSA LGQMFAKIGI DTKINAISRT VFIPARKRGE YVLSLAGWGS LTGEAGYTLA AIARTNDKSK GFGASNVTNY SNDAADAAIS VAMRMTNDDK RRLLFESAMK MVQDDYAIIP VVQLSSVWAA RANTLSFKPR VDDETLPFFI RLDKK
|
| |