Gene Mlg_0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0417 
Symbol 
ID4269456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp466704 
End bp467834 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID638125147 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_741261 
Protein GI114319578 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000311026 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.00936904 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGCAGAA GCAGCACCCT CATCCTGATC GGCCTCGGGT TCACCGCCCT GACCGCCCCC 
ACGCTGGCAC CGGCCACCGA CGTTGAGAAC AGCCAGCACC ACCGCTTCGA GATCGAACGC
CTGGGCCAGG GCTTCAGCCA CCCCTGGGGG CTCGCCTTCC TGCCCGACGG CGACCTGCTG
GTCACCGAGC GCCCGGGACG GCTGCAGCGC GTCGACGCCG GGACCGGTGA GCGCCGGCGT
ATCGAGGGCA CCCCGGACGT CGCCGCCACC GGCCAGGGCG GTATGCTCGA CATCGCCCTG
CACCCGGACT TCGACACCAA CCGCTACGTC TACCTCACCT ACTCCGCCTA CGGCCGCGGC
GGCATGACCA CCCACCTGGG CCGCGGTGTG CTGGATGGCG ACACCCTGCG TGACTTCGAG
CTGCTGTACG CGGCCACCCC CTACTCCGGC GGCGGCCGCC ACTTCGGCTC GCGGATCGTC
TTTGACGACG ACGGCTATCT CTTTATGACC ATGGGCGACC GCGGCCGGCG CGAGCGCGCA
CAGCAGTTGG ACAACCACCA CGGCAAACTG CTGCGGCTGC ACGACGACGG CGGCATCCCC
GCGGACAACC CCTTCGTGGA TGACGAGGGC GCCGAGCCCG CCATCTACAG CTACGGCCAC
CGCAACGCCC AGGGCATGAC CCTGCACCCG GAGACCCGGG TGCTCTGGCT GCACGAACAC
GGCCCGCGCG GCGGTGACGA GATCAACCTG CCGCGCCCGG GCCTCAACTT CGGCTGGCCG
GAGGCCACCT TCGGTACCGA GTACCACGGC CCGGAGATCG CCCCCGACCC GCCGGTGGCA
GGCATGGAAC CCCCCATCCA CCACTGGACG CCCTCCATCG CCCCCTCCGG CATGGCCTTC
TACTACGCCG ACGCCTTCCC GGAGTGGCAG GGTGATCTGT TCGTCGGGGC CCTGGCCCAC
CGCCACCTGG AACGGTTGCG CCTGGACGGC ACCGACGTGG TGGAGCAGGA GCGCCTGCTG
CAAGGACTCG GCTGGCGCAT CCGCGACGTG CGGGTCGGTC CCAAGGGCCA TCTCTACGTC
CTCCCGGACC GCAGTAGCAC GCCCCTCTTG CGGCTCCGCC CCGCCGACTG A
 
Protein sequence
MCRSSTLILI GLGFTALTAP TLAPATDVEN SQHHRFEIER LGQGFSHPWG LAFLPDGDLL 
VTERPGRLQR VDAGTGERRR IEGTPDVAAT GQGGMLDIAL HPDFDTNRYV YLTYSAYGRG
GMTTHLGRGV LDGDTLRDFE LLYAATPYSG GGRHFGSRIV FDDDGYLFMT MGDRGRRERA
QQLDNHHGKL LRLHDDGGIP ADNPFVDDEG AEPAIYSYGH RNAQGMTLHP ETRVLWLHEH
GPRGGDEINL PRPGLNFGWP EATFGTEYHG PEIAPDPPVA GMEPPIHHWT PSIAPSGMAF
YYADAFPEWQ GDLFVGALAH RHLERLRLDG TDVVEQERLL QGLGWRIRDV RVGPKGHLYV
LPDRSSTPLL RLRPAD