Gene Mlg_0611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0611 
Symbol 
ID4268490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp661108 
End bp662688 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content69% 
IMG OID638125358 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_741455 
Protein GI114319772 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.316467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000000336516 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCACAG TAAGTGACCT CCGGCCGGTG CGCCGGGCGC TCATCAGTGT TTCCGACAAG 
ACCGGCATCG AGCGGTTTGC CCGGGCCCTG CACGAGCAGG GGGTGGAGCT GCTCTCCACC
GGCGGCACCG CCCGACTGCT GCGCGAGGCC GGCCTGCCGG TCACCGAGGT CTCCGACTAC
ACCGGTTTCC CCGAGATCAT GGCCGGCCGG GTGAAAACCC TGCACCCCAA GGTCCACGGC
GGCCTGCTGG GGCGGCGGGG CACCGACGAT GAGGTCATGG CGGAGCAGGG CATTCAGCCC
ATCGACCTGC TTGCGGTCAA CCTCTATCCC TTCGAGCGCA CCGTGGCCGA TCCGGACTGC
CGCCTTGAGG AGGCCATCGA GAACATCGAC ATCGGCGGTC CGGCCATGCT GCGCGCCGCG
GCAAAGAACC ATGCCGATGT GGCGGTGGTC ACCGATCCGG CCGACCACGA GGCGTTGATC
GATGAGCTCA AGCGTGAAGG CGGTCTGGGG CGGGCCACCC GCTTCAACCT GGCAGTGAAG
GCGTTCGAGC ACACGGCCCG TTACGACGGG GCCATCGCCA GTTACCTGGG TGCCCGGCTG
GGCGAGGGTG AACCGGCGCG GTTCCCGCGC ACCTTCAATG TGCAGTTCGA GAAGGCGCTG
GATATGCGCT ACGGTGAGAA CCCGCACCAG GCGGCCGCCT TCTACCGCGA GCACGATGTT
GTGGAGCCCT GTGTCGCCAC TGCCGAGCAG TATCAGGGCA AGGCGCTGTC CTACAACAAC
GTGGCCGATA CCGATGCGGC GCTGGAGTGT GTCAAGGCCT TCGAGGCGCC GGCCTGTGTC
ATCGTCAAGC ACGCCAACCC CTGCGGGGTG GCCGTCGGCC AGGACCTGCT GAGTGCCTAT
GAGCGCGCCT TTGAGGCGGA CCCCACCTCG GCCTTCGGCG GCATCATCGC CTTCAACCGC
GAGTTGGACG GCAGGACCGC GGCGGCGATC GTCGAGCGCC AGTTTGTGGA GGTGATCATC
GCCCCCAGCG TCACCGCGGA GGCCCGCGAG GCCGTGGCCG CCCGCAAGAA CGTCCGCCTG
CTGGCCTGCG GCCAGTGGGG GCCGGAGCGG GCCCCGGGCC TGGACTACAA GCGGGTGGGC
GGCGGCCTGC TGGTGCAGGA GCGGGACATC GCCCGGGTGC CGCAGGGGGC GCTCAAGGTG
GTCACCCGCA AGCAGCCGGA CGAGCAGACC TGGCAGGACC TGCTGTTTGC CTGGGCGGTG
GTGCGCTACG TCAAGTCCAA CGCCATCGTA TTCGCCGCCG ACGGGCGCAG CCTGGGCATC
GGCGCCGGGC AGATGAGCCG GGTGTTCAGC ACCCGTATCG CCCGCGACAA GGCTGCCGAG
GCCGGACTGG AGGTCAAGGG CGCGGCCATG GCCTCCGACG CCTTCTTCCC CTTCCGCGAC
GGCCTCGACC AGGCGGCGGA GGCGGGCATT GGGGCGGTGA TCCAGCCGGG TGGCTCGATG
CGCGACCAAG AGGTGATCGA CGCCGCTGAC GAGCATGGGC TGGTCATGGT ATTCACGGGT
ATGCGGCACT TCCGGCACTG A
 
Protein sequence
MATVSDLRPV RRALISVSDK TGIERFARAL HEQGVELLST GGTARLLREA GLPVTEVSDY 
TGFPEIMAGR VKTLHPKVHG GLLGRRGTDD EVMAEQGIQP IDLLAVNLYP FERTVADPDC
RLEEAIENID IGGPAMLRAA AKNHADVAVV TDPADHEALI DELKREGGLG RATRFNLAVK
AFEHTARYDG AIASYLGARL GEGEPARFPR TFNVQFEKAL DMRYGENPHQ AAAFYREHDV
VEPCVATAEQ YQGKALSYNN VADTDAALEC VKAFEAPACV IVKHANPCGV AVGQDLLSAY
ERAFEADPTS AFGGIIAFNR ELDGRTAAAI VERQFVEVII APSVTAEARE AVAARKNVRL
LACGQWGPER APGLDYKRVG GGLLVQERDI ARVPQGALKV VTRKQPDEQT WQDLLFAWAV
VRYVKSNAIV FAADGRSLGI GAGQMSRVFS TRIARDKAAE AGLEVKGAAM ASDAFFPFRD
GLDQAAEAGI GAVIQPGGSM RDQEVIDAAD EHGLVMVFTG MRHFRH