Gene Mlg_2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2737 
Symbol 
ID4270991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3105865 
End bp3107112 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content70% 
IMG OID638127499 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_743567 
Protein GI114321884 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.417309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGC CTGGTTCATT CCTTACGACC CCAGCCCGCT GCGTAGTCGG GGCAGCGGTC 
CCAGCGGTCT TGCTGGTGAC CAGCCTGTCG GCGCTGGCGG AGGTCAGTCG CAGTGACCAG
GTGGTGGCCG CCGGGGTCGA GAGCGAGCAG GCCCGCTTTG ATGTCGTCCA GGTGCTGGAG
GGGTTGGAGC ACCCCTGGGC GGTGGCCTGG CTGCCCGATG GCCGCAAGCT CGTCACCGAA
CGCCCCGGCC GCCTCTGGCT GGTGGATGGC GATGACATCA CCCGCGTGGG TAATACCCCC
CGGGTGAACC CCGAGGAGCG CGACGGTCTG GCCCTGGAGG CGAGCTGGCA GGGCGGCCTG
CTCGATCTGG CGGTCCACCC CGATTACGAG GACAACGGCT GGATCTACAT GACCTATTCC
AGCCCGGGGG ATCCGGACGC GGTCATCGGC GACGCCGAGT TTGGCAGCGG CACCGCCCTG
GCCCGGGCGC GGTTGAGCGA CGACGGCAGC CAGCTCACCG ACCTGGAGAC GCTGTACGTG
CAGATGCCCC GCACCGCCCC GGGCCGGCAC TACGGCTCGC GGATCGTCTT TCCGGGCGAC
GGCACCGTGA TCTTCTCCAT CGGCGACAGC GGCCTGCGCG CCCCCTCGCA GGATCTGACC
GATCCGGCCG GGTCCATGAT CCGCCTCAAC GAGGATGGTG GCGCCGCCGA GGACAACCCG
CTGGTGGGCA TGGCGCCGGG CAACCTGCGG CCGGAGATCT ACTCCTTCGG TCACCGCAAC
AACCAGGGCC TGGCCATCCA CCCGGAGACC GGTGAGATCT GGACCAGCGA GCACGGGCCC
CGGGGCGGCG ACATGATCCA CCGGATCGAG CCCGGCAACA ACTACGGCTG GCCCCAAGTG
GCCTACGGCA CCGAGTACTC CACCGACGAG CAGGTCGGCA TTGGCCGGTC CGCCCCCGGC
GTGACCCCGG CGGTCCACTA CTGGGACTAC TCTATGGCCC CCTCAGGGCT CGCCTTCTAC
AGCGGTGATG AGGTGCCGGG CTGGCAGGGC GATCTGTTTG CCGGGTCGCT GGCCGAGGAG
CGCTTGCACC GGCTGGTGCT GGAGGGCGAC CGCGTGGTCC ACGAGGAGCT CCTGCTCGAC
GGCACCCTCG GGCGCATCCG GGATGTGCGG CAGGGGCCGG ACGGCCGGCT CTACCTGCTC
ACGGATGAGG AGTCGGGGGG GCTCTATCGG TTGGAGCCCG CCCACTGA
 
Protein sequence
MQQPGSFLTT PARCVVGAAV PAVLLVTSLS ALAEVSRSDQ VVAAGVESEQ ARFDVVQVLE 
GLEHPWAVAW LPDGRKLVTE RPGRLWLVDG DDITRVGNTP RVNPEERDGL ALEASWQGGL
LDLAVHPDYE DNGWIYMTYS SPGDPDAVIG DAEFGSGTAL ARARLSDDGS QLTDLETLYV
QMPRTAPGRH YGSRIVFPGD GTVIFSIGDS GLRAPSQDLT DPAGSMIRLN EDGGAAEDNP
LVGMAPGNLR PEIYSFGHRN NQGLAIHPET GEIWTSEHGP RGGDMIHRIE PGNNYGWPQV
AYGTEYSTDE QVGIGRSAPG VTPAVHYWDY SMAPSGLAFY SGDEVPGWQG DLFAGSLAEE
RLHRLVLEGD RVVHEELLLD GTLGRIRDVR QGPDGRLYLL TDEESGGLYR LEPAH