Gene M446_5253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5253 
Symbol 
ID6130234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5770542 
End bp5772110 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content70% 
IMG OID641645388 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001772011 
Protein GI170743356 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.789001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00462633 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCGGAT TCGACCTGAA CGACGACGGG CTCGTCGTCA TCGTGGGCTC GGGCGCGGGC 
GGCGGCACGC TCGGCACCGA ATTGGCGCTC AAGGGCATCA GGACCGTGAT CCTGGAGGCC
GGCGCCCGGC ACAACATGGA GGATTTCGTC AACGACGAGT GGGCGAGCTT CGCCCAGCTC
GCCTGGACGG ACATGCGCAC CACCTCCGGC TCCTGGCGGG TGGCGCGGGA CTTCCCCAAC
CTGCCGGCCT GGATCGTCAA GGCGGTGGGC GGCTCGACCG TGCACTGGGC CGGCGCCTCG
CTGCGCTTCG AGGAGCACGA GTTCCGCATC CGCGACCATT ACGGCGCGAT CCCGGGCGCC
AACCTGCTCG ACTGGCCGAT CACCCGGGCC GAACTCGACC CCTGGTACGA GAAGGCCGAG
GACCGGATGG GCGTGACCCG CACCAACGGC ATCCCGGGCC TGCCCGGCAA CAACAACTAC
AAGGTGCTTG AGGCCGGCGC CCGCCGCCTC GGCTACCGGG AGGTCCATAC CGGCCGGATG
GCGATCAACA GCCAGGAGCG CCACGACCGC GGCGCCTGCC AGCAGATCGG CTTCTGCTTC
CAGGGCTGCA AGTCGGGGGC GAAGTGGTCG ACGCTGATCG CCGAGATCCC CCGCGGCGAG
GCGACCGGCA ACCTGGAGGT CCGCCCGGGC TGCATGGCGA TCCGCATCGA GCACGATGCC
TCCGGCAAGG TGACGGGCGT CGTCTACGCG GACGAGACCG GAACGCTGCA GCGCCAGAAG
GCCCGCATCG TCGCGGTGGC CGGCAACTCG ATCGAGAGCC CGCGCCTCCT CCTCAACAGC
GCCTCGTCCC TGTTCCCGGA CGGGCTCGCC AATTCCTCCG GGCAGGTCGG CCGCAACTAC
ATGCGGCACA TGACCGGCAG CGTCTACGGC GTGTTCGAGA AGTCGGTCCA CATGTACCGC
GGCACCACCA TGGCGGGCAT CATCCGCGAC GAGGCGCGCC ACGACCCGTC GCGCGGCTTC
GCGGGCGGCT ACGAGATGGA GACGCTCTCC CTCGGCCTGC CCTTCATGGC GGCCTTCCTC
AACCCGGGCG CCTGGGGGCG CAGCTTCACC AGCGCCATGG AGCAGTATCC GCGCATGGCC
GGGATGTGGC TCGTCGGCGA GGACATGCCC CAGGAGACGA ACCGCATCAC CCTCGACCCG
GTGCAGAAGG ACGCGCACGG GATGCCGGTC GCGCACGTCC ACTTCGACGA CCACCCGAAC
GACATCGCGA TGCGCGACCA CGCCTACCGG CAGGGCGCGG CGGTCTACGA GGCGGTGGGC
GCGACCGTCA CCTACCCGAC CCCGCCCTAT CCGAGCACCC ACAACATGGG CACCAACCGC
ATGAGCGCGC GGCCGCGCGA CGGCGTCGTG AACAAGTTCG GCCAGACCCA CGACGTCGGG
AACCTGTTCG TCTCGGACGG CAGCCAGTTC ACCAGCGGCG CGGCCTGCAA CCCGACGCTG
ACCATCGTGG CCCTGGCGCT GCGGCAGGCG GACCACATCG CGGGCGCGAT GCAGCGGCGG
GAGATCTGA
 
Protein sequence
MAGFDLNDDG LVVIVGSGAG GGTLGTELAL KGIRTVILEA GARHNMEDFV NDEWASFAQL 
AWTDMRTTSG SWRVARDFPN LPAWIVKAVG GSTVHWAGAS LRFEEHEFRI RDHYGAIPGA
NLLDWPITRA ELDPWYEKAE DRMGVTRTNG IPGLPGNNNY KVLEAGARRL GYREVHTGRM
AINSQERHDR GACQQIGFCF QGCKSGAKWS TLIAEIPRGE ATGNLEVRPG CMAIRIEHDA
SGKVTGVVYA DETGTLQRQK ARIVAVAGNS IESPRLLLNS ASSLFPDGLA NSSGQVGRNY
MRHMTGSVYG VFEKSVHMYR GTTMAGIIRD EARHDPSRGF AGGYEMETLS LGLPFMAAFL
NPGAWGRSFT SAMEQYPRMA GMWLVGEDMP QETNRITLDP VQKDAHGMPV AHVHFDDHPN
DIAMRDHAYR QGAAVYEAVG ATVTYPTPPY PSTHNMGTNR MSARPRDGVV NKFGQTHDVG
NLFVSDGSQF TSGAACNPTL TIVALALRQA DHIAGAMQRR EI