Gene M446_3645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3645 
Symbol 
ID6133367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4064365 
End bp4066143 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content76% 
IMG OID641643812 
Productglucose/sorbosone dehydrogenase-like protein 
Protein accessionYP_001770460 
Protein GI170741805 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.683493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00853617 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGCCC GCTTCGCCGC GCTCGAGACG CGCCGCCCCG CTTCCCGCCC GCTCCGGGCC 
GCCGCCTCCC TCGGCCTCGC GCTCGCCGCG GCCGTCCCGG CGCTCGCGGC CGAGGGCGCC
TGCCCGGGAC CCAATGCCGG GCTCTCCCTG CCGCCGGGCT TCTGCGCCAC GGTCTTCGCG
GACGATCTCG GCCATGTCCG CCAGATGGCG GTGGCGCCCG ACGGCACGCT CTACGCCAAT
ACTTGGAGCG GGTCCTACTA CAAGACCGCC GCGCCCCCGG GCGGCTTCCT GCTCGCCCTG
CGGGACCGCA CCGGGACCGG GAAGGCCGAC GCGGTCGAGC GCTTCGGCGA GAGCGCGGCC
GAGGGCGGGC ACGGCGGCAC CGGCCTCGCC CTGTTCGAGG GCGCCGTCTA CGCGGAGAGC
AACGACCGCA TCCTGCGCTA CCCGCTCGCG CCGGGGGACC TCGCGCCGAC GGCGAAGGCC
GAATTGGTCG TCTCCGGCCT GCCGCTCGGC GGCGACCACC CGATGCACCC CTTCGCGATC
ACGCCGCAGG GCGACCTCCT CGTCGATCTC GGCTCGGCCA CCAATGCCTG CGAGGTGAGG
AACCGCATGC CGGGCTCGCG CGGCCACGCC CCCTGCACCG AGAAGGAGAC GCGGGCCGGG
ATCTGGCGCT ACGACGCGCG CCGGACCGGC CAGACCTTCT CGCCCGCCGA GCGCTACGCC
ACGGGCCTGC GCAACGCCGA GGGCTTCGCC CTCGACGCCG AGGGCCGCGT CCTCGTCACC
CAGCACGGCC GCGACCAGCT GCACGAGAAC TGGCCGGCGC TCTACACCGC CCGGCAGGGA
TTCGAGCTGC CGGCCGAGGA GGTGGTCGAG CTCAAGGCCG GCGCCGATTA CGGCTGGCCC
GAATGCTACT ACGATGCCGA GCAGAGGAAG CTCGTCCTCG CTCCCGAATA CGGCGGCGAC
GGCGGCAGGA AGGTCGGGCT CTGCGCCGAC CGCCAGGGGC CGGTCGCGGC CTTCCCGGCC
CATTGGGCGC CCAACGACAT GAAGATCTAC CTCGCGGCGG GGCCGAAGGC CTTCCCGTCC
GCCTATCGGG GCGGTGCGCT CATCGCCTTC CACGGCTCCT GGAACCGCGC CCCCGGGCCG
CAGGGCGGCT ACAACGTGGT GTTCCAGCCC CTCAGGGACG GCAGGGCCGC CGGGCCGTTC
GCGGTCTTCG CGGACGGGTT CGCGGGCGCG GTCAAGGAGC CGGGCCGGGC GCAGTTCCGG
CCGACCGGCC TCGCGGTCGC CCCGGACGGG GCGCTGTACA TCTCCGACGA CGTCCGCGGC
CGGATCTGGC GCGTCACCTA CCAGGGCGGC AATCCGCAGG CGGCGATCGC CGCCGCCCCG
GAGGCCAGGC CGGCCGCCAG GGGCTCGCGG GCGGAGCTTC CCCCCGAGGG CATCCACCCG
GATGCGGGGC GCGAGGCCGC CTCCCTGCCG GTGCCCGAGG GGGCGACGCG CGAGGAGGTG
GCGCGCGGGG CGCGGATCTT CGTGGGGGAG ATCGGTGGGG CGACCTGCGG CGGCTGCCAC
GGCTCGGACG CCAAGGGCTC GCCGATCGGA CCCGACCTCA CCGACCATGA GTGGGTGTGG
AGCGACGGCA GCCTCGCGGG CATCACCCGC ACGATCGCGA GCGGCGTGCC CGAGCCCAGG
AAGCACGGCG GCGCGATGCC GCCGATGGGC GGGGTCGCGC TCTCGGAGCA GGACCTCAGG
GCCGTCGCGG CCTATGTCTG GGCGGTGGGG CACCGCTGA
 
Protein sequence
MRARFAALET RRPASRPLRA AASLGLALAA AVPALAAEGA CPGPNAGLSL PPGFCATVFA 
DDLGHVRQMA VAPDGTLYAN TWSGSYYKTA APPGGFLLAL RDRTGTGKAD AVERFGESAA
EGGHGGTGLA LFEGAVYAES NDRILRYPLA PGDLAPTAKA ELVVSGLPLG GDHPMHPFAI
TPQGDLLVDL GSATNACEVR NRMPGSRGHA PCTEKETRAG IWRYDARRTG QTFSPAERYA
TGLRNAEGFA LDAEGRVLVT QHGRDQLHEN WPALYTARQG FELPAEEVVE LKAGADYGWP
ECYYDAEQRK LVLAPEYGGD GGRKVGLCAD RQGPVAAFPA HWAPNDMKIY LAAGPKAFPS
AYRGGALIAF HGSWNRAPGP QGGYNVVFQP LRDGRAAGPF AVFADGFAGA VKEPGRAQFR
PTGLAVAPDG ALYISDDVRG RIWRVTYQGG NPQAAIAAAP EARPAARGSR AELPPEGIHP
DAGREAASLP VPEGATREEV ARGARIFVGE IGGATCGGCH GSDAKGSPIG PDLTDHEWVW
SDGSLAGITR TIASGVPEPR KHGGAMPPMG GVALSEQDLR AVAAYVWAVG HR