Gene M446_5503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5503 
SymbolmdoD 
ID6131357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6035714 
End bp6037402 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content74% 
IMG OID641645637 
Productglucan biosynthesis protein D 
Protein accessionYP_001772253 
Protein GI170743598 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.545727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.294343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAC GACGCCCCGC CTCGCCCCGG CCGACCCACG CGCGAGAGAT CCAGGCCCCC 
GCCCGCGACG CGGCGGCCGG CCTCGACCGC CGCGCCGTGC TGGCGCTCGC CGGTGCCTTC
GCGGCGGGCG GCGCCCTCGC GCCGACGGGC GCCCGCGCGG AGGATCCGCC CGCGCCGCCG
GCGCTCGGGG ACGGCGAGCG CTTCGACCCG GCCGCGGTGG TGGCCCTGGC CCGGGCGCTG
GCGGCCAAGC CCTACGCGGC CCCGCCCTCC GCCCTGCCGG ACGCCTTCGG CAAGCTCACC
TACGAGCAGT ACATCGCGAT CCGGCCGAAG CCCGAACTCC TGATCTGGCG CGGGGAGGGC
CGCGGCTTCG TGGTCGAGCC CCTGCACCGG GGCTACCTGT TCACGAACCC GGTCAGCCTG
CACACGGTCG AGGACGGGAT CGTCCGGCGC ATCCCCTACG ACCGCGACCT GTTCGAGTTC
CGCAAGACCG CCCCGCCCGA GGCCGGCCGC GACCTCGGCT TCTCGGGCTT CCGCCTCTAC
GGGAGCTTCG GGGGCGGGCC GCCCGCCGAT TGCGCGATCT TCCAGGGCGC CTCGTTCTTC
CGGGCCCTCG CCGCGGGCCA GAGCTACGGC ATCACCGCGC GCGCCCTCAC CCTGCGCCCG
GCCGAGGCGC GGGGCGAGGA ATTCCCGCTC TTCCGGGCCT TCTGGCTGGA GCGGCCGGGC
CCGGCCAGCA CCCAGATCGT GATGCACGCG CTGATCGATT CCGACTCCGC CGCCGCGGCC
CTGCGCATGA CGCTGCGCCC GGGCGACGCC ACCATCGTGG ACATCGAGGC GACCCTGTGC
CCGCGCACCA GCCTCGACCA TGTCGGGCTC GGCGGCATGA CGGGCGCCTA CCTGTTCGGC
CCGGCCGACC ACCGCAACAC CGACGACGCC CGCCCCGCCG CCCACGAGAT CGGCGGCCTG
CAGATCCGCA ACGGCGGGGG CGAGGCGCTC TGGCGGCCCG TGCAGAACCC CGAGACGCTG
CAGATCTCCT CCTTCCTCGA CCAGGACCCG AAGGGCTTCG GCCTCGTGCA GCGCCACCGC
GACTACGCGG ATTACCAGGA CGACGTGCAG CACTGGGAGC GCCGCCCGAG CCTGTGGATC
GAGCCCTTCG GCACCTGGCA GCCCGGCCAG GAGCAGCGCG GCTGGGGCCC GGGCGCGGTC
CAGCTCATCG AGATCCCGAG CGAGTCCGAG GTCAACGAGA ACATCCTGGC CTATTGGCGG
CCCAAGGAGC CGATCGCCAG GGAGACCGCT TTCACCTTCC GGCAATACTG GTGCTGGGAC
GTGCCGGGGG CGCCGCCCCT GGCGCGCTGC ACCGGCACGC GGATCGGCAA GGGCAGTTCG
GGCAAGCGCC GCCTCTTCCT CGTCGACTTC GCGGGCGACG CGCTCTTCGC CCCGGAGGTG
AGGCCCGAAT CCCTGAAGCT CGCGGTCACG GCCTCGCCGG GGACGATCGT GCAGCCCAAG
CCCGACGCGC CGCCGGTGAA GCGCGACCCG CAATTCCGCC CCCTCCTCTA CCTCTATCCC
GAGCGCAAGA CGGTGCGCGC CGTCTTCGAA CTGGACCCGG GCGGCGAGAC CGCCTGCGAG
ATGCGCCTCG TCGTCAAGAA CGGCGACCAA CCCGTCAGTG AGACATGGCT GTTTCGTTGG
ACCCCGTGA
 
Protein sequence
MTRRRPASPR PTHAREIQAP ARDAAAGLDR RAVLALAGAF AAGGALAPTG ARAEDPPAPP 
ALGDGERFDP AAVVALARAL AAKPYAAPPS ALPDAFGKLT YEQYIAIRPK PELLIWRGEG
RGFVVEPLHR GYLFTNPVSL HTVEDGIVRR IPYDRDLFEF RKTAPPEAGR DLGFSGFRLY
GSFGGGPPAD CAIFQGASFF RALAAGQSYG ITARALTLRP AEARGEEFPL FRAFWLERPG
PASTQIVMHA LIDSDSAAAA LRMTLRPGDA TIVDIEATLC PRTSLDHVGL GGMTGAYLFG
PADHRNTDDA RPAAHEIGGL QIRNGGGEAL WRPVQNPETL QISSFLDQDP KGFGLVQRHR
DYADYQDDVQ HWERRPSLWI EPFGTWQPGQ EQRGWGPGAV QLIEIPSESE VNENILAYWR
PKEPIARETA FTFRQYWCWD VPGAPPLARC TGTRIGKGSS GKRRLFLVDF AGDALFAPEV
RPESLKLAVT ASPGTIVQPK PDAPPVKRDP QFRPLLYLYP ERKTVRAVFE LDPGGETACE
MRLVVKNGDQ PVSETWLFRW TP