Gene Mlg_2752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2752 
Symbol 
ID4270221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3123387 
End bp3124685 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content63% 
IMG OID638127514 
Productcitrate synthase 
Protein accessionYP_743582 
Protein GI114321899 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.816634 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGA AGACCGTCAC CCTGACCGAC AACAGCACCG GCAAATCCGT GGAGTTGCCG 
GTCTATCAAG GCACCCACGG CCCCGAGGTC ATCGACATCA AGAATGTCTA TGGCGAGCTG
GGCTACTTCA CTTACGACGC TGGCTTCACC TCCACCGCCA GCTGCAAAAG CGATGTCACC
TTCATCGATG GTGACAACGG CGTGCTGCTG TATCGCGGCT ACCCCATCGA ACACCTGGCT
GAGAAGAGCT CCTTCCTGGA GGTCTCCTAC CTACTGCTGC ACGGCGAATT GCCGAACAAG
GCCGAACTGG ACCAGTTCGT CAGCTCGGTG ACCAACCACA CCATGCTCAA CGAAAGCCTG
AAGGACTTCT TCGACGGCTT TCATTACAAC GCCCACCCCA TGGCCATGCT CACCGGGGTG
GTCGGGTCGC TATCCGCCTT CTACCACGGC GAACTGGACA TCAACGACCC GAAGAACCGG
GAGCTGACCG CGCACCGGGT CATCGCCAAG ATGCCGACCA TCGCCGCGGC GGCCTACAAA
CACCTGGTGG GCGAGCCCTT CGTCTACCCG CAGAACCACC TGTCCTACGC GGGGAACCTG
CTGAACATGC TGTTCTCCCG CCCCACCGAG AAGTACGAGG TTAACCCCGT GGCCGAGCGG
GCGCTGGACC AGCTCCTGAT CCTGCACGCT GACCACGAGC AGAACGCCTC CACCTCCACG
GTGCGCCTGG CCGGTTCCAC CGGCACCAAC CCCTTCGCCG CCATCGCAGC TGGTTGCGCC
GCGCTGTGGG GACCGGCGCA TGGGGGGGCC AACGAGGCGG TGCTGAACAT GCTCAACGAG
ATCGGCGACG TCTCCAACGT GCCCAAGTTC ATCGAAAAGG CGAAGGACAA GAACGACCCC
TTCCGCCTGA TGGGCTTCGG TCACCGGGTC TACAAGAACT TCGACCCGCG GGCCACCATC
ATCCGCAAGA CCTGTCACGA GGTCCTGGAG GAACTCGGCG TGGGCAAGGA CCCGCAGCTG
GAGCTGGCCA TGGAGCTGGA GGATATCGCC CTGCAGGACG AGTACTTCGT CGAGCGCAAG
CTCTACCCGA ACGTCGACTT CTACTCGGGC ATCATCTACC GCGCGCTGGG CATCCCCACC
GAGTTCTTCA CGGTGCTGTT TGCCCTGGGC CGCACCCCGG GCTGGCTGGC GCAGTGGATG
GAGATGGTCA ACGACCCCGA GCAGCGCATC GGGCGTCCGC GCCAGCTCTA CACCGGCGCC
GCCAAGCGCG ACTACGTGCC GGTGGATCAG CGCAGCTGA
 
Protein sequence
MSEKTVTLTD NSTGKSVELP VYQGTHGPEV IDIKNVYGEL GYFTYDAGFT STASCKSDVT 
FIDGDNGVLL YRGYPIEHLA EKSSFLEVSY LLLHGELPNK AELDQFVSSV TNHTMLNESL
KDFFDGFHYN AHPMAMLTGV VGSLSAFYHG ELDINDPKNR ELTAHRVIAK MPTIAAAAYK
HLVGEPFVYP QNHLSYAGNL LNMLFSRPTE KYEVNPVAER ALDQLLILHA DHEQNASTST
VRLAGSTGTN PFAAIAAGCA ALWGPAHGGA NEAVLNMLNE IGDVSNVPKF IEKAKDKNDP
FRLMGFGHRV YKNFDPRATI IRKTCHEVLE ELGVGKDPQL ELAMELEDIA LQDEYFVERK
LYPNVDFYSG IIYRALGIPT EFFTVLFALG RTPGWLAQWM EMVNDPEQRI GRPRQLYTGA
AKRDYVPVDQ RS