Gene Mrad2831_4579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMrad2831_4579 
Symbol 
ID6140646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium radiotolerans JCM 2831 
KingdomBacteria 
Replicon accessionNC_010505 
Strand
Start bp4874295 
End bp4876085 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content69% 
IMG OID641630294 
Productextracellular solute-binding protein 
Protein accessionYP_001757228 
Protein GI170750968 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGC GGGATCTGCG CGGCCTCATC GGGCGGGTGA AGGGCGGTGG CCTGTCGCGG 
CGCGCCTTCG TGCAGCGGAT GGTGGCCCTG GGCCTCACCG CGCCCATGGC CGGGCTGATG
CTGGCCGGGA ACGGTGTCGC GATGGCGGCC GATATCCGGT CCGGCTACAA GCCGACCAAG
GCCGGCGGCG GCGGCGCGCT CAAGCTGCTC TGGTGGCAGG CGCCGACGCT GATCAACCCG
CATTTCGCCA TCGGCACCAA GGACCAGGAC GCCTCGCGGA TCTTCTACGA GCCGCTCGCC
GCCTGGGACG CGGACGGCAA CCTCGTGCCG GTTCTGGCCG CCGCGATCCC GTCCAAGGAG
AACGGGGCGC TCGCCGCCGA CGGCCGCTCG GTGGTCTGGA CGCTCAAGCC CGGCGTGAAG
TGGCACGACG GCAAGCCGCT CACCGCCGAC GACCTCGTCT TCACCTGGGA GTACGCCCGC
GACCCCGCCA CCGCGGCGGT GACGGCCGGG TCCTACAAGG ATTGCAAGGT CGAGAAGGTC
GACGACCTCA GCGTCCGGGT GCTGTTCGAC AAGCCGACGC CCTACTGGTG CGACGCCTTC
GTGGGCATCG TCGGGATGGT GCTGCCGAAG CACCTGTTCG GCCCGTACAG CGGCGCCAAG
TCCCGGGACG CGCCGCAGAA CCTCGCCCCC GTGGGCACCG GCCCGTACCG GTTTGTGGAG
TTCCGGCCCG GCGACATCGT CCGCGGGGAG CGCAACCCGG ATTACCACCT GCCGAACCGG
CCGTATTTCG ACACGATCGA GATGAAGGGC GGCGGGGACG CGGTCTCGGC CGCCCGCGCG
GTGCTCCAGA CCGGCGAGTA CGACTACGCC TGGAACATGC TGATCGAGGA CGAGGTGCTC
AAGCGCCTGG AGACCGGCGG CAAGGGCCGG GTCGACGTGG TCTACGGCGG CAAGCTCGAG
TTCCTGCTCC TCAACGCCAC CGACCCGAAC GTCGAGGTCG ACGGCGAGCG CGCCTCGATC
ACGACGAAGC ACCCCGCCTT CTCCGACCCG AAGGTGTGCC AGGCGATGAA CCTGCTGGTC
GACCGCAAGT CGATCCAGAC CTACATCTAC GGCCGCACCG GCAAGCCCAC CGCCAACACG
GTCAACGGCC CGGAGCGCTT CGTCTCCAAG AACACGAGCT TCGCCTTCGA CCCGGCCAAG
GCCAACGCGC TCCTCGACGA GGCCGGCTGG AAGAAGGGCT CCGACGGCAT CCGCGCCAAG
GACGGCAAGA AGCTGAAGCT CGTCTTCCAG ACCTCGATCA ACGCGCCGCG CCAGAAGACC
CAGGCGATCA TCAAGCAGGC GGCCGCCAAG GCCGGCATCG AGATCGAGCT GAAATCGGTG
ACCGGCTCGG TGTTCTTCTC CTCCGACCCG GCGAACCCCG ACACCTGCAC GCATTTCTAC
GCCGACATGG AGATGTACGC CTACAGCATG ACGCAGGCCG ATCCGGCGAT CTGGCTGCTG
ATGTACGCCT CCTGGGAGGT GGCCCAGAAG GCCAACAAGT GGCAGGGCCG CAACGTCGTG
CGCTGGCGCA ACGACGCCTA CGACAAGGCC TACAACGCCG CCCAGAGCGA GCTCGACCCG
GTCAAGCGCG CGGCGCTCCT GATCACCTGC AACGACCTCG CGGTGTCCGA GAACGTGCTG
CCGCTGATCC ACCGCGCCGA GGTCTCGGCG GTGGGCGCCA CGCTCACGGC GCCCCGCAGC
GGCTGGGACA ACGACCTGTC GTTCCTGCCC GACTGGTACA GGGAGGCCTG A
 
Protein sequence
MDERDLRGLI GRVKGGGLSR RAFVQRMVAL GLTAPMAGLM LAGNGVAMAA DIRSGYKPTK 
AGGGGALKLL WWQAPTLINP HFAIGTKDQD ASRIFYEPLA AWDADGNLVP VLAAAIPSKE
NGALAADGRS VVWTLKPGVK WHDGKPLTAD DLVFTWEYAR DPATAAVTAG SYKDCKVEKV
DDLSVRVLFD KPTPYWCDAF VGIVGMVLPK HLFGPYSGAK SRDAPQNLAP VGTGPYRFVE
FRPGDIVRGE RNPDYHLPNR PYFDTIEMKG GGDAVSAARA VLQTGEYDYA WNMLIEDEVL
KRLETGGKGR VDVVYGGKLE FLLLNATDPN VEVDGERASI TTKHPAFSDP KVCQAMNLLV
DRKSIQTYIY GRTGKPTANT VNGPERFVSK NTSFAFDPAK ANALLDEAGW KKGSDGIRAK
DGKKLKLVFQ TSINAPRQKT QAIIKQAAAK AGIEIELKSV TGSVFFSSDP ANPDTCTHFY
ADMEMYAYSM TQADPAIWLL MYASWEVAQK ANKWQGRNVV RWRNDAYDKA YNAAQSELDP
VKRAALLITC NDLAVSENVL PLIHRAEVSA VGATLTAPRS GWDNDLSFLP DWYREA