Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_4737 |
Symbol | |
ID | 4694070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 5227805 |
End bp | 5228695 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639852480 |
Product | extracellular solute-binding protein |
Protein accession | YP_999450 |
Protein GI | 121611643 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.765927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCT CCGTCCTGAT GGCATCCATG GCTGCTGCGG TGCTCTTCAG CGCCACCGGC GCGCAAGCCG CCGACACGCT CGCCAAGATC GCCGAGTCCG GCAAGATCAC GCTCGCCTAC CGCGAGTCGT CGGTGCCCTT CAGCTACCTG GACGGCGCCA ACAAGCCGAT CGGGTTCTCG GTGGAACTCT CCAACGCCGT GGTCGAAGCG GTGAAGAAGA AGTTGAACAA ACCCAACCTG CAGGTCGCGC TGATGCCCGT CACTTCGCAG AACCGCATTC CGCTGCTCAC CAACGGCACC GTCGATCTGG AGTGCGGCTC CACCACCAAC AACAGCGCGC GCGGCAAGGA CGTGGCCTTC GCGGTCAATC ACTTCTATAC CGGCACGCGG CTCTTGGCGA AGAAGTCCTC CAAGATCAAG GACTACGCCG ACCTCGCCAA GAAGACCGTG GCCAGCACCA CCGGCACCAC CAACGCGCAG GTCATGCGCA AGTACAACGC CGACAAGAAT CTTGGCATGG ACATCGTGCT CGGCAAGGAC CACGCCGACG CCTTCCTGCT GGTCGAGAGC GACCGCGTCG TCGCCTTCGC GATGGACGAC ATCCTGCTGT TCGGCCTGAT CGCCAACTCG AAAAACCCGG CCGACTACGA AGTGGTGGGC GAGTCGCTGC AGGTCGAGCC CTACGCCTGC ATGCTGCCCA AGGACGACCC GGCCTTCAAG AAACTGGTGG ACGACACCTT CATCGACATG ATGAAAAGCG GCGAGTTCGA GAAGCTCTAC GCCAAGTGGT TCATGCAGCC GATCCCGCCG AAGAACGTGC CGCTGAACCT GCCGATGAAC GATCAGTTGA AAGAGAACCT CAAGGCCTTC AGCGACAAGC CGGCGACCTG A
|
Protein sequence | MKRSVLMASM AAAVLFSATG AQAADTLAKI AESGKITLAY RESSVPFSYL DGANKPIGFS VELSNAVVEA VKKKLNKPNL QVALMPVTSQ NRIPLLTNGT VDLECGSTTN NSARGKDVAF AVNHFYTGTR LLAKKSSKIK DYADLAKKTV ASTTGTTNAQ VMRKYNADKN LGMDIVLGKD HADAFLLVES DRVVAFAMDD ILLFGLIANS KNPADYEVVG ESLQVEPYAC MLPKDDPAFK KLVDDTFIDM MKSGEFEKLY AKWFMQPIPP KNVPLNLPMN DQLKENLKAF SDKPAT
|
| |