Gene Mlg_0377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0377 
Symbol 
ID4269002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp421933 
End bp423048 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID638125108 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase / GTP cyclohydrolase II 
Protein accessionYP_741222 
Protein GI114319539 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0664139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTCA ACAGCATCGA CGAGATCATC GAGGACCTGC GCGAAGGGCG CATGGCGGTC 
ATCCTGGACG ACGAGGACCG GGAGAACGAG GGCGACCTGG TCATGGCGGC CAGCATGGTC
CGGCCCGACG ACATCAACTT CATGGCCCGC TACGGCCGCG GTCTCATCTG CCTGACCCTC
ACCCGCGAGC GCTGCGAGCA GCTCCGGCTG CCGCTCATGG TCCAGGACAC CGAGCAGGCC
CAGTCCACCA ACTTCACCGT CTCCATCGAG GCCGCCACCG GCGTGACCAC CGGCATCTCC
GCCGCGGACC GGGCGCGCAC CGTGCAGGCC GCGGTGGCGC CGCACGCAAA GCCCGAGGAC
CTGGTCCAAC CGGGGCACAT CTTCCCGCTC ATGGCCCAGC CCGGCGGCGT GCTCACCCGC
GCCGGGCACA CCGAGGCCGG CTGCGACCTG GCCCGCCTGG CCGGTTTCGA GCCCTCGGCG
GTGATCGTCG AGATCCTCAA AGAGGACGGC ACCATGGCCC GGCGCGACGA CCTGATGGCC
TTCGCCCGCG AGCACAACCT GAAGATCGGC ACGGTCGCCG ACCTGATCGC CTACCGCGTC
CGCAACGAGC GCTCGGTGGA ACGGGTGGGC GAATGCGACC TGCCCACCGA GCACGGCATC
TTCCACCTCT ACGCCTACCA GGACAACGTG GACAATGCCC TGCACTTCGC CCTGGTCAAG
GGCCGGCCCC AGCCGGACAC CCCCACCCTG GTCCGGGTGC ACGTCCAGAA CACCCTCTCC
GACGTCTTCG CCAGCGATGG GCCGCACTGC GGTTGGCCGC TGCGCGCCGC CATGCGCCAG
GTGGCCGAGG CCGGGGAGGG CGTGGTGGTG GTGCTGCGCC GGCGCGACGA CAGCGACGAC
ATCCTCAAGC GCATGCGCGC CTACCAGGTG CAGGCCAGTC AGGCCGACGA GAGCGAGGAG
GCGCGCTCCA GCAGTGATCT GCGCACCTAC GGCCTGGGCG CGCAGATCCT CACCGACGTG
GGCGTGCGCC GTATGCGCGT ACTGTCCGCC CCCAAGCGTA TGCACGCCAT TTCCGGCTTC
GGCATGGAAG TGGTGGAATA CGTAGAGCCT GAATAG
 
Protein sequence
MAFNSIDEII EDLREGRMAV ILDDEDRENE GDLVMAASMV RPDDINFMAR YGRGLICLTL 
TRERCEQLRL PLMVQDTEQA QSTNFTVSIE AATGVTTGIS AADRARTVQA AVAPHAKPED
LVQPGHIFPL MAQPGGVLTR AGHTEAGCDL ARLAGFEPSA VIVEILKEDG TMARRDDLMA
FAREHNLKIG TVADLIAYRV RNERSVERVG ECDLPTEHGI FHLYAYQDNV DNALHFALVK
GRPQPDTPTL VRVHVQNTLS DVFASDGPHC GWPLRAAMRQ VAEAGEGVVV VLRRRDDSDD
ILKRMRAYQV QASQADESEE ARSSSDLRTY GLGAQILTDV GVRRMRVLSA PKRMHAISGF
GMEVVEYVEP E