Gene M446_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1913 
Symbol 
ID6134520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2137687 
End bp2138796 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content64% 
IMG OID641642152 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001768820 
Protein GI170740165 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCTA TCCTTCTCGC CACTGCCACG CTCGTGGTGA CTTCCCTCCC TGTTGCGGGT 
CAACCAGCGC AGCGCCTCAG AACAGAGAAG GCCGAGATCA TCGTCGAAAC CGTCGCGGGC
GGTCTCAACC ATCCCTGGGG TCTCGCCTTC CTGCCGGATG GACGCATGTT GGTCACCGAA
AAGCCGGGGC GCCTGCGCAT CGTCTCGGCC GAAGGGGAGA TCTCACCTCC GATCGCCAAG
ACGCCGCAGC CGTCCATTCA GTTTTTGGAC GTAGCACTCG ATCCCAATTT CTCTGAGAAC
CAGCTCGTTT ACCTTACGTA TGTCGAGCCG CGTGGGGGCG GCTTGGCCAC GGCGGCAGGA
CGTGGGCGGC TCAGCACGAC CGGTACGACT TTGGAGGGCT TCGAGGTCAT CTTTCGGCAG
CAGCCGGCCT CGCCGATCGA GGATCACTTC GGATCGCGCC TCGCCTTCAC GCCCGACAGC
AAGCTCTTCA TCTCGACAGG AGACCGTGAC GAGCCTGACT CGGCTCAGGA TCTCTCCACC
CACATGGGCA AGCTCGTCCG CGTCAACCGG GACGGCTCCG TGCCGGCCGA CAACCCATTC
GTGCATCGTG CAGGAGTTCG GCCAGAGATC TGGTCCTACG GCCATCGAAA CATCGAGGGC
CTCGCCGTCC AGCCGGGTAC AGGCGTCCTC TGGGCGGGGG AGTTCGGGCC GACCGGCGGA
GATGAAATCA ACATTCCCAA GCCGGGCGGC AACTACGGTT GGCCCTTAGT GAGCTGGGGT
GATCACACGG ATGGGCGCGT GATCCCGCGG CCGCCGACCC GGCCTGACCT GACGGACGCC
ATTTATCACT GGACACCATC GGTCTCGTTC TCTGGGATGA CGTTCTACAC GGGGGCTGCG
TTTCCGGCCT GGCATGGAAA CCTGCTGCTG GCTGGACTGG CTTCACAGGC CTTGATCCGT
CTGACGCTCG CCGGGGCACG TGTCACTGGG GAGGAGCGCA TCCCGATGGA CGCACGCATC
CGGCATGTTG CCCAAGGACG GGATGGCCTT CTCTACCTTC TGACCGACGA GGACCAGGGG
CGGATCCTAC GTTTCAAGCC GGGCGGCTAA
 
Protein sequence
MRSILLATAT LVVTSLPVAG QPAQRLRTEK AEIIVETVAG GLNHPWGLAF LPDGRMLVTE 
KPGRLRIVSA EGEISPPIAK TPQPSIQFLD VALDPNFSEN QLVYLTYVEP RGGGLATAAG
RGRLSTTGTT LEGFEVIFRQ QPASPIEDHF GSRLAFTPDS KLFISTGDRD EPDSAQDLST
HMGKLVRVNR DGSVPADNPF VHRAGVRPEI WSYGHRNIEG LAVQPGTGVL WAGEFGPTGG
DEINIPKPGG NYGWPLVSWG DHTDGRVIPR PPTRPDLTDA IYHWTPSVSF SGMTFYTGAA
FPAWHGNLLL AGLASQALIR LTLAGARVTG EERIPMDARI RHVAQGRDGL LYLLTDEDQG
RILRFKPGG