Gene Mlg_1381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1381 
Symbol 
ID4269573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1581642 
End bp1583528 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content70% 
IMG OID638126137 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_742220 
Protein GI114320537 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.604583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.510709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTT CCATAGACGA CTACCCGCTG CTGGCCCAGG TACCCGACCC TGAGGCCCTG 
CGCACCCTGC CGGCCCACCG CCTGCCGGCG CTGGCCAAAG AGTTGCGCCA CTACCTGCTG
CACAGCGTGG CCCGTAGCGG CGGGCACTTG GCGGCCGGAC TCGGTGCCGT GGAGCTGACC
CTGGCCCTGC ACTACGTCTT CAATACCCCC GAGGACCGTC TGGTCTGGGA TGTGGGCCAC
CAGTGCTACC CGCACAAGAT CCTCACCGGG CGCCGCGAGC GCCTGGGCAC CATCCGCAAA
TACGGCGGCC TGGCGGGTTT CCCCAAGCGC GCCGAGAGCC CCTACGACAC CTTCGGGGTC
GGCCACTCCA GCACCTCGAT CAGCGCCGCC CTGGGCATGG CGCTGGCCGC GCGGCAGGCC
GGGGAGCAGC GCAAGGCGGT GGCCATCATC GGTGACGGCG GGATGACCGC CGGCGAGGCC
TTCGAGGCGA TGAACCACGC CGGCGATGCG GGCGCGGACC TGCTAGTGGT GCTCAACGAC
AACGAGATGT CCATTTCCGA GAACGTGGGC GCCCTCTCCC AGCACCTGAC TCGGATCCTG
TCGGGCCGCT GGTACCACCA GTTGCGCTCC GGCAGTAAGG AGGTGCTGCG CCGCCTGCCG
CCGCCGGTCC ATGAACTGGC CCGGCGCACC GAGGAGCACC TCAAAGGGAT GGTGGTCCCC
GGCACGCTGT TCGAGGAGTT AGGGTTTCAG TATTTCGGGC CGGTGGACGG CCACAACGTG
GACGCCCTGG TGGAGGTGCT CGGCAACCTG GCCCATCAAC GGGGGCCGCG CCTGTTGCAT
GTGGTCACCT GCAAGGGCAA GGGTTACCGG CCGGCGGAGC AGGACCCCAT CGCCTACCAC
GGGGTCGGGC AGTTCGATCC CGAGCAGGGG CTGCCGAAGA AATCCGGGGG CTCGCTGGCC
TACAACCAGG TCTTCGGACG TTGGCTCTGC GCCATGGCGG AACAGGACCC GCGGCTGGTG
GCCATCACCC CCGCCATGCG CGAGGGCTCC GGCATGGTCG AGTACGCCCG GCGCTTTCCC
GAGCGCTACC ACGACGTGGG CATCGCCGAG CAGCACGCGG TCACGCTGGC GGCAGGCCTG
GCCTGCGAGT CGGTGAAACC GGTGTTGGCG ATCTACTCCA CCTTCCTGCA GCGGGGTTAC
GACCAGTTGG TCCACGACGT GGCCCTGCAG AACCTGCCGG TGCTGTTCGC GGTGGACCGG
GCCGGCCTGG TGGGCGCCGA CGGCCCCACC CACCACGGCA GCTTCGACCT CTCTTACCTG
CGCTGCGTGC CCAATATGAC CATTGCCGCC CCCTCCGACG AGGCGGAGTG CTGGCGGTTG
CTGAGCACCG GCTACCACCA CGACGGTCCC TTCGCGGTGC GCTACCCCCG CGGGTCCGGC
CCCGGCGCGG CACTCCCGGA AGCGGACCTG GACCCGCTGG CCATTGGCAA GGGTGTTTGT
CGTCGCCGCG GGCGGCGGAT CGCGGTGCTG GCCTTCGGCA CCCTCGTGGT ACCGGCATTG
GCGGTGGCCG AGGCCCTGGA CCTCACCGTG GCCGATATGC GCTTCGTACG GCCCCTTGAC
GAGGCCCTGA TCCGTGAGCT GGCGGACACC CACGACCTGC TGGTGACCGT GGAGGAGAAC
GCGGTGGCGG GTGGTGCCGG CAGCGGGGTG AGCGAGTACC TGGCCCGGGC CGGTCTGGAC
GTCCCGGTGC GCCACCTCGG CCTGCCGGAC CGTTTCGTTG ACCACGGCAC TCCGGCGGAA
CTGCTCGCCG AGGTGGGCCT GGACGAGGCC GGCCTTCAGC GCAGTCTACA GGGGTGGCTC
GACACCCTCC CGGGCGCGTC ACGCTGA
 
Protein sequence
MSVSIDDYPL LAQVPDPEAL RTLPAHRLPA LAKELRHYLL HSVARSGGHL AAGLGAVELT 
LALHYVFNTP EDRLVWDVGH QCYPHKILTG RRERLGTIRK YGGLAGFPKR AESPYDTFGV
GHSSTSISAA LGMALAARQA GEQRKAVAII GDGGMTAGEA FEAMNHAGDA GADLLVVLND
NEMSISENVG ALSQHLTRIL SGRWYHQLRS GSKEVLRRLP PPVHELARRT EEHLKGMVVP
GTLFEELGFQ YFGPVDGHNV DALVEVLGNL AHQRGPRLLH VVTCKGKGYR PAEQDPIAYH
GVGQFDPEQG LPKKSGGSLA YNQVFGRWLC AMAEQDPRLV AITPAMREGS GMVEYARRFP
ERYHDVGIAE QHAVTLAAGL ACESVKPVLA IYSTFLQRGY DQLVHDVALQ NLPVLFAVDR
AGLVGADGPT HHGSFDLSYL RCVPNMTIAA PSDEAECWRL LSTGYHHDGP FAVRYPRGSG
PGAALPEADL DPLAIGKGVC RRRGRRIAVL AFGTLVVPAL AVAEALDLTV ADMRFVRPLD
EALIRELADT HDLLVTVEEN AVAGGAGSGV SEYLARAGLD VPVRHLGLPD RFVDHGTPAE
LLAEVGLDEA GLQRSLQGWL DTLPGASR