Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mrad2831_4579 |
Symbol | |
ID | 6140646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium radiotolerans JCM 2831 |
Kingdom | Bacteria |
Replicon accession | NC_010505 |
Strand | + |
Start bp | 4874295 |
End bp | 4876085 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641630294 |
Product | extracellular solute-binding protein |
Protein accession | YP_001757228 |
Protein GI | 170750968 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAGC GGGATCTGCG CGGCCTCATC GGGCGGGTGA AGGGCGGTGG CCTGTCGCGG CGCGCCTTCG TGCAGCGGAT GGTGGCCCTG GGCCTCACCG CGCCCATGGC CGGGCTGATG CTGGCCGGGA ACGGTGTCGC GATGGCGGCC GATATCCGGT CCGGCTACAA GCCGACCAAG GCCGGCGGCG GCGGCGCGCT CAAGCTGCTC TGGTGGCAGG CGCCGACGCT GATCAACCCG CATTTCGCCA TCGGCACCAA GGACCAGGAC GCCTCGCGGA TCTTCTACGA GCCGCTCGCC GCCTGGGACG CGGACGGCAA CCTCGTGCCG GTTCTGGCCG CCGCGATCCC GTCCAAGGAG AACGGGGCGC TCGCCGCCGA CGGCCGCTCG GTGGTCTGGA CGCTCAAGCC CGGCGTGAAG TGGCACGACG GCAAGCCGCT CACCGCCGAC GACCTCGTCT TCACCTGGGA GTACGCCCGC GACCCCGCCA CCGCGGCGGT GACGGCCGGG TCCTACAAGG ATTGCAAGGT CGAGAAGGTC GACGACCTCA GCGTCCGGGT GCTGTTCGAC AAGCCGACGC CCTACTGGTG CGACGCCTTC GTGGGCATCG TCGGGATGGT GCTGCCGAAG CACCTGTTCG GCCCGTACAG CGGCGCCAAG TCCCGGGACG CGCCGCAGAA CCTCGCCCCC GTGGGCACCG GCCCGTACCG GTTTGTGGAG TTCCGGCCCG GCGACATCGT CCGCGGGGAG CGCAACCCGG ATTACCACCT GCCGAACCGG CCGTATTTCG ACACGATCGA GATGAAGGGC GGCGGGGACG CGGTCTCGGC CGCCCGCGCG GTGCTCCAGA CCGGCGAGTA CGACTACGCC TGGAACATGC TGATCGAGGA CGAGGTGCTC AAGCGCCTGG AGACCGGCGG CAAGGGCCGG GTCGACGTGG TCTACGGCGG CAAGCTCGAG TTCCTGCTCC TCAACGCCAC CGACCCGAAC GTCGAGGTCG ACGGCGAGCG CGCCTCGATC ACGACGAAGC ACCCCGCCTT CTCCGACCCG AAGGTGTGCC AGGCGATGAA CCTGCTGGTC GACCGCAAGT CGATCCAGAC CTACATCTAC GGCCGCACCG GCAAGCCCAC CGCCAACACG GTCAACGGCC CGGAGCGCTT CGTCTCCAAG AACACGAGCT TCGCCTTCGA CCCGGCCAAG GCCAACGCGC TCCTCGACGA GGCCGGCTGG AAGAAGGGCT CCGACGGCAT CCGCGCCAAG GACGGCAAGA AGCTGAAGCT CGTCTTCCAG ACCTCGATCA ACGCGCCGCG CCAGAAGACC CAGGCGATCA TCAAGCAGGC GGCCGCCAAG GCCGGCATCG AGATCGAGCT GAAATCGGTG ACCGGCTCGG TGTTCTTCTC CTCCGACCCG GCGAACCCCG ACACCTGCAC GCATTTCTAC GCCGACATGG AGATGTACGC CTACAGCATG ACGCAGGCCG ATCCGGCGAT CTGGCTGCTG ATGTACGCCT CCTGGGAGGT GGCCCAGAAG GCCAACAAGT GGCAGGGCCG CAACGTCGTG CGCTGGCGCA ACGACGCCTA CGACAAGGCC TACAACGCCG CCCAGAGCGA GCTCGACCCG GTCAAGCGCG CGGCGCTCCT GATCACCTGC AACGACCTCG CGGTGTCCGA GAACGTGCTG CCGCTGATCC ACCGCGCCGA GGTCTCGGCG GTGGGCGCCA CGCTCACGGC GCCCCGCAGC GGCTGGGACA ACGACCTGTC GTTCCTGCCC GACTGGTACA GGGAGGCCTG A
|
Protein sequence | MDERDLRGLI GRVKGGGLSR RAFVQRMVAL GLTAPMAGLM LAGNGVAMAA DIRSGYKPTK AGGGGALKLL WWQAPTLINP HFAIGTKDQD ASRIFYEPLA AWDADGNLVP VLAAAIPSKE NGALAADGRS VVWTLKPGVK WHDGKPLTAD DLVFTWEYAR DPATAAVTAG SYKDCKVEKV DDLSVRVLFD KPTPYWCDAF VGIVGMVLPK HLFGPYSGAK SRDAPQNLAP VGTGPYRFVE FRPGDIVRGE RNPDYHLPNR PYFDTIEMKG GGDAVSAARA VLQTGEYDYA WNMLIEDEVL KRLETGGKGR VDVVYGGKLE FLLLNATDPN VEVDGERASI TTKHPAFSDP KVCQAMNLLV DRKSIQTYIY GRTGKPTANT VNGPERFVSK NTSFAFDPAK ANALLDEAGW KKGSDGIRAK DGKKLKLVFQ TSINAPRQKT QAIIKQAAAK AGIEIELKSV TGSVFFSSDP ANPDTCTHFY ADMEMYAYSM TQADPAIWLL MYASWEVAQK ANKWQGRNVV RWRNDAYDKA YNAAQSELDP VKRAALLITC NDLAVSENVL PLIHRAEVSA VGATLTAPRS GWDNDLSFLP DWYREA
|
| |