Gene Mlg_0275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0275 
Symbol 
ID4270493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp316674 
End bp317750 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content65% 
IMG OID638125000 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_741120 
Protein GI114319437 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.447479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.950957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC GTACTGTGGA CGACCTGAAC GTCGAGAAGA TCGAGCACCT CCCCACCCCC 
GCCGAGATCA AGGCCCAGCT GCCGCTCAGC GAGCAAGCCC GCCGTCTGGT GGTCGAGGGT
CGGGAGACCG TTCGCAACAT CCTGGATGGC AAGGATCACC GGCTGTTGGT GGTGGTGGGG
CCCTGTTCCA TCCACGACCC CAAAGCGGCG CTGGACTATG CCAAGCAACT GAAAGCGCTG
AGTGATCAGG TGGGCGACAG TCTGTTCATC GTCATGCGGG TCTATTTTGA GAAGCCGCGC
ACGGTGACCG GGTGGAAGGG GCTGATCAAC GACCCCGACA TGGACGACTC CTTCCGGATC
GACAATGGCC TGTTCCAGGC GCGCAAACTG CTGCTGGACC TGGCTGAGAT GGGACTGCCC
ACAGCCACCG AAGCGCTCGA CCCGATCATC CCGCAGTACC TGCAGGACCT GATCACCTGG
ACGGCCATTG GTGCCCGCAC CACCGAATCG CAGACCCACC GCGAGATGGC CAGCGGCCTC
TCCACGCCGG TCGGATTCAA GAATGGCACC GACGGCAGCC TGGACGTGGC CATCAACGCC
ATGAAGTCCG CCGCCCATCC GCACAGCTTC CTGGGCATCA ACTCCCGCGG CGAGTGCAGT
ATCATCCGGA CCCGCGGCAA CAGCTACGGC CACGTGGTGC TGCGCGGCGG CCATGGCCAG
CCCAATTACG ACAGCGTGCA CATTGCCCTG TGCGAGCAGG AGCTGGAAAA GGCGGGGCTG
CCCGCGCGGA TCGTGGTCGA CTGCAGCCAC GCCAACTCCA ACAAGGACCC GGCGCTGCAG
CCCATGGTGC TGAAGGACCT GGTGCACCAG ATCCTGGAGG GCAACCAGTC GCTGGTGGGC
GTCATGCTGG AGAGCAACCT GGGCTGGGGC AACCAGAAGC TGGGGGCCGA TCCCGCTGCC
CTCGACTACG GGGTCTCCAT CACCGATGCC TGTATCGACT GGCCGACCAC CGAGCAGGGT
CTGCTGGAGG CGGCGGAAAA GCTGCGCGAG GTGTTGCCCC GGCGGGCCGC GGCCTGA
 
Protein sequence
MSDRTVDDLN VEKIEHLPTP AEIKAQLPLS EQARRLVVEG RETVRNILDG KDHRLLVVVG 
PCSIHDPKAA LDYAKQLKAL SDQVGDSLFI VMRVYFEKPR TVTGWKGLIN DPDMDDSFRI
DNGLFQARKL LLDLAEMGLP TATEALDPII PQYLQDLITW TAIGARTTES QTHREMASGL
STPVGFKNGT DGSLDVAINA MKSAAHPHSF LGINSRGECS IIRTRGNSYG HVVLRGGHGQ
PNYDSVHIAL CEQELEKAGL PARIVVDCSH ANSNKDPALQ PMVLKDLVHQ ILEGNQSLVG
VMLESNLGWG NQKLGADPAA LDYGVSITDA CIDWPTTEQG LLEAAEKLRE VLPRRAAA