Gene Mlg_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1868 
Symbol 
ID4268086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2129487 
End bp2130614 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID638126624 
Productsuccinyl-diaminopimelate desuccinylase 
Protein accessionYP_742702 
Protein GI114321019 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01246] succinyl-diaminopimelate desuccinylase, proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGA CACTGGAACT CGCCTCCGCA CTCATCGCCC GCCGCTCGGT GACGCCCATG 
GACGCCGGCT GCCAGCAGTT GCTGGCGGAG CGATTGCGGC CCCTCGGTTT TGACTGTGAG
CGGCTGGATT ACGGCGAGGT GAACAATCTG TGGGCCCGGC GGGGTCAGCA GGGGCCGGTG
TTCTGTTTCG CCGGCCATAC CGATGTGGTG CCCCCGGGGC CGGAGGCCCA ATGGCGGCAC
CCACCCTTCC AGCCGGTGGT CGAGCAGGGG CTGCTCTACG GCCGCGGCGC GGCGGACATG
AAGGGCAGCG TCGCGGCCTT TGTCACCGCC CTGGAGCGCT ACCTGGCCGG CGGCCACCGG
CCGCGGGGTT CGCTCGCCCT GCTGATCACC AGCGACGAGG AGGGCCCGGC GGTGGACGGC
ACCCGGCACG TGGTCGAGAC CCTGTCCGAG CGCGGCGAGC GCATCGACTG GTGCCTGGTG
GGTGAGCCCT CCAGCACCGA ACGCGTGGGG GATGTGGTGA AGGTGGGCCG GCGCGGGTCG
CTCAACGGGC GGCTGACGGT GCGCGGCGAC CAGGGCCACG TGGCCTATCC CCATTTGGCG
CGCAATCCGG TGCACCAGGC GCTGGCCGCC CTGGATGAGC TGGTCACCAC CCGCTGGGAC
GAGGGCAACG ACCATTTCCC GCCCACCAGC TTCCAGATCT CCAACGTCCA AGCCGGCACC
GGCGCCACCA ACGTGATCCC CGGCGAGCTG GAGGTGACGT TCAATTTCCG CTTCTCCACC
GAGGTAACGG CGGATGAGTT ACAGCAGCGG GTAGAGGCGG TGCTGGACCG TCACGGCCTG
GACGGGCGGA TCGACTGGTC GCTCTCGGGC GAGCCCTTTC TGACCGCGGA GGGGGAGCTG
GTGGCCGCCA CCCAGGCGGC GGTCCGCGAT GTCTGCGGCG ACCCACCGGT GCTCTCCACT
TCCGGCGGCA CCTCAGACGG CCGCTTCATC GCCCCCACCG GGGCCCAGGT CCTGGAGCTG
GGGCCTGTGA ACGCCACCAT CCACAAGGTG AACGAGCACG TGCGGGCGGC GGATCTGGAC
ACGCTGTCAA GGATTTACGA GGGCGTCCTG CGCCGACTGC TCGGCTGA
 
Protein sequence
MSETLELASA LIARRSVTPM DAGCQQLLAE RLRPLGFDCE RLDYGEVNNL WARRGQQGPV 
FCFAGHTDVV PPGPEAQWRH PPFQPVVEQG LLYGRGAADM KGSVAAFVTA LERYLAGGHR
PRGSLALLIT SDEEGPAVDG TRHVVETLSE RGERIDWCLV GEPSSTERVG DVVKVGRRGS
LNGRLTVRGD QGHVAYPHLA RNPVHQALAA LDELVTTRWD EGNDHFPPTS FQISNVQAGT
GATNVIPGEL EVTFNFRFST EVTADELQQR VEAVLDRHGL DGRIDWSLSG EPFLTAEGEL
VAATQAAVRD VCGDPPVLST SGGTSDGRFI APTGAQVLEL GPVNATIHKV NEHVRAADLD
TLSRIYEGVL RRLLG