Gene Mlg_2762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2762 
Symbol 
ID4269123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3135684 
End bp3136778 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content71% 
IMG OID638127524 
Product3-dehydroquinate synthase 
Protein accessionYP_743592 
Protein GI114321909 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.998331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA CCACCACCCT GCAGGTGGAT CTGGGCAGCC GCAGCTACCC GATCCACATC 
GGCACCGGCC TGCTCGACCA GCCCCGGCTG ATCACCGACC ACCTGCCCGC CCGGCGGGTG
ATGGTGGTGA CCAACGAAAC CGTCGCCCCC CATTACCTCG ACCGGGTGTT GGCCAGCCTC
GATGGCCGCG AGGCCCACAG CGTGGTGCTC CCCGACGGCG AGCGCTTCAA GAGCCTGGAG
ACCGCCATGC GCGTCTATGA CGCGCTGATC GAGCACCGCT TCGACCGCGG CTCCGCCATC
GTCGCCCTGG GCGGCGGCGT GATCGGCGAT CTCGCCGGCT TCGTGGCCGC CACCTGGCAG
CGGGGGGTGC GCTACCTCCA GGTGCCCACC ACCCTGTTGG CGCAGGTGGA CTCCTCAGTG
GGCGGCAAGA CCGCGGTCAA CCACCCGGGC GGCAAGAACA TGATCGGCGC CTTTCACCAG
CCCCGCTGCG TGATCGCCGA CATGGACACC CTGGACACCC TCCCCGACCG GGAGCTGCGC
GCCGGGCTGG CCGAGGTGAT CAAGTACGGG CTGATCCGCG ACGCCGACTT TCTGACCTGG
CTGGAGGCGA ACATGCCGGC CCTGCTGGCT CGCGACAAGG CGGCGCTGGC CGAGGCGGTG
GAGCGCTCCT GCCGGCACAA GGCCGCGGTG GTGGCCGAGG ACGAGCTGGA GGCGGGGCAG
CGGGCCCTGC TCAACCTGGG CCATACCTTC GGCCACGCCA TCGAGACCCA TACCGGCTAT
GGGGCCTGGC TGCACGGTGA GGCGGTGGCC GCCGGGATGG TGATGGCCGG CTGGATGTCC
ATGCGCCAGG GCTGGCTGAG CGAGGCCGAT TTCCACCGCC TGGAGGCCAT CCTCTCCACC
GCCGGCCTGC CCGTGGCCCC GCCGGCGATG GAGAGTGAGC GCTTCCGCGA ATTGATGGCC
GTGGACAAGA AGGTGCAGGA CGGCCGGCTG CGGCTGGTGC TGTTGCGGGC GCTGGGCGAC
GCCGTGGTCA CCGATGCCTT CGACCCCGCC GCGCTGGAGG CCACCCTGGA CCACTACTGC
GCCAGCCCCG AATAG
 
Protein sequence
MTETTTLQVD LGSRSYPIHI GTGLLDQPRL ITDHLPARRV MVVTNETVAP HYLDRVLASL 
DGREAHSVVL PDGERFKSLE TAMRVYDALI EHRFDRGSAI VALGGGVIGD LAGFVAATWQ
RGVRYLQVPT TLLAQVDSSV GGKTAVNHPG GKNMIGAFHQ PRCVIADMDT LDTLPDRELR
AGLAEVIKYG LIRDADFLTW LEANMPALLA RDKAALAEAV ERSCRHKAAV VAEDELEAGQ
RALLNLGHTF GHAIETHTGY GAWLHGEAVA AGMVMAGWMS MRQGWLSEAD FHRLEAILST
AGLPVAPPAM ESERFRELMA VDKKVQDGRL RLVLLRALGD AVVTDAFDPA ALEATLDHYC
ASPE