Gene M446_4893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4893 
SymbolmdoD 
ID6132290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5372914 
End bp5374470 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content74% 
IMG OID641645029 
Productglucan biosynthesis protein D 
Protein accessionYP_001771656 
Protein GI170743001 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00109342 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTCGACC GCCGCCGCTT CCTCGCCGCC GCCGCGCTCC TGGCCGGCGG CCTCCCCGCC 
CGCGCGGCCG GCCTGCCGCT CGGGGCGCCG AGCCCCTTCG ACTTCGAGGC GCTGAAGGCC
CGCGCCCGCG ACCTCGCCGC CGCGCCCTAC CGGCCCCCGG CGATCCCCGA CCGGGAGGTG
CTGCAGGCGA TCGACTACGA CGCCCACGGC AAGCTGCGCT TCAAGCCCGA CCACGCCCTC
TGGGCCGAGG GCCCGGGCGC CTTCCCGGTG ACCTTCTTCC ATCTCGGCCG CTACTTCCAG
AAGCCGGTGC GGATGCACCT GGTCGAGGGG GGCGAGGCCC GCGAGATCGT CTACGTCCAG
GACGCCTTCG AGATGCCGGC GGATTCGCCC GCCCGGCGCC TGCCGCCGAA TCCCGGCTTC
GCGGGTTTCC GCTTCCAGGA GCGGCGCGGC GGCGCCCTCG ACTGGCGGCG CAACGACTGG
GTGGCGTTCC TCGGCGCCTC CTATTTCCGG GCGATCGGCG AACTCTACCA GTACGGACTC
TCGGCGCGGG GCCTCGCCCT CGACACCGTG ATGCCGGACC GGCCGGAGGA GTTTCCCGAC
TTCACTCATG TCTGGTTCGA GACGCCGGCG CCGGATTCCG ACACCGTCAC GGTGATGACG
CTCCTCGACG GCCCCTCGGT GGCGGGCGCC TACCGGTTCC GGATGCGGCG CGGCAAGGCC
GTGGTGATGG AGATCGAGGC GCGGCTGCAC CTGCGCCGGG ACGTCGGCCG CTTCGGGCTG
GCGCCGCTCA CCTCGATGTA CTGGTTCTCC GAGACGGCCA AGCCCAGCGC CGTCGACTGG
CGCCCCGAGG TGCACGATTC GGACGGGCTG GCCCTGTGGA CCGGGAGCGG CGAGCGCCTC
TGGCGGCCCC TGCGCAACCC GCCCCGGACC ATGGTCTCGG CCTTCGTGGA CGCGCGGCCG
CGCGGCTTCG GGCTGATGCA GCGCGACCGC CTGTTCGACC ATTACCAGGA CGGGGTCTAC
TACGACCGCC GGCCCTCGCT CTGGGTCGAG CCGCTCGGGG ATTGGGGCCG GGGCAGCGTG
CAGCTCATCG AGAACCCGAC CGACGACGAG ATCCACGACA ACGTCGTGGC CATGTGGGTG
CCGGAGGAGC CGGCCCGGGC CGGCTCCGTG CACGACCTCG CCTACCGGCT GCACTGGGTG
GCCGACGAGC CCTATCCGTC CGCGCTCGCG CGCTGCGTCG CGACCCGCGA GGGCAATGGC
GGGCAGGCCG GGACGGAGCG CCCGAAGGGC CTGCGCAAGT TCGTGGTGGA GTTCCTGGGC
GGGCCGCTGG CGCAGCTCCC CGCGGGCGTG AAGCCCGAGC CGGTGCTCAG CGCCTCCCGC
GGCAGCTTCC CGCTCGCCCG CACGGAGGCG GTGCCGGACG ACGTGCCGGG CCATTGGCGC
GCCGAGTTCG ACCTCGCGGT CACCGGGTCC GAGCCGGTGG AATTGCGGCT CTTCCTGCGC
CAGGGCGACC GCACGCTCAG CGAGACCTGG ACCTACCAGG TCATCCCGGC GGCCTGA
 
Protein sequence
MLDRRRFLAA AALLAGGLPA RAAGLPLGAP SPFDFEALKA RARDLAAAPY RPPAIPDREV 
LQAIDYDAHG KLRFKPDHAL WAEGPGAFPV TFFHLGRYFQ KPVRMHLVEG GEAREIVYVQ
DAFEMPADSP ARRLPPNPGF AGFRFQERRG GALDWRRNDW VAFLGASYFR AIGELYQYGL
SARGLALDTV MPDRPEEFPD FTHVWFETPA PDSDTVTVMT LLDGPSVAGA YRFRMRRGKA
VVMEIEARLH LRRDVGRFGL APLTSMYWFS ETAKPSAVDW RPEVHDSDGL ALWTGSGERL
WRPLRNPPRT MVSAFVDARP RGFGLMQRDR LFDHYQDGVY YDRRPSLWVE PLGDWGRGSV
QLIENPTDDE IHDNVVAMWV PEEPARAGSV HDLAYRLHWV ADEPYPSALA RCVATREGNG
GQAGTERPKG LRKFVVEFLG GPLAQLPAGV KPEPVLSASR GSFPLARTEA VPDDVPGHWR
AEFDLAVTGS EPVELRLFLR QGDRTLSETW TYQVIPAA