Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0611 |
Symbol | |
ID | 4268490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 661108 |
End bp | 662688 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125358 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_741455 |
Protein GI | 114319772 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.316467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000000336516 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCACAG TAAGTGACCT CCGGCCGGTG CGCCGGGCGC TCATCAGTGT TTCCGACAAG ACCGGCATCG AGCGGTTTGC CCGGGCCCTG CACGAGCAGG GGGTGGAGCT GCTCTCCACC GGCGGCACCG CCCGACTGCT GCGCGAGGCC GGCCTGCCGG TCACCGAGGT CTCCGACTAC ACCGGTTTCC CCGAGATCAT GGCCGGCCGG GTGAAAACCC TGCACCCCAA GGTCCACGGC GGCCTGCTGG GGCGGCGGGG CACCGACGAT GAGGTCATGG CGGAGCAGGG CATTCAGCCC ATCGACCTGC TTGCGGTCAA CCTCTATCCC TTCGAGCGCA CCGTGGCCGA TCCGGACTGC CGCCTTGAGG AGGCCATCGA GAACATCGAC ATCGGCGGTC CGGCCATGCT GCGCGCCGCG GCAAAGAACC ATGCCGATGT GGCGGTGGTC ACCGATCCGG CCGACCACGA GGCGTTGATC GATGAGCTCA AGCGTGAAGG CGGTCTGGGG CGGGCCACCC GCTTCAACCT GGCAGTGAAG GCGTTCGAGC ACACGGCCCG TTACGACGGG GCCATCGCCA GTTACCTGGG TGCCCGGCTG GGCGAGGGTG AACCGGCGCG GTTCCCGCGC ACCTTCAATG TGCAGTTCGA GAAGGCGCTG GATATGCGCT ACGGTGAGAA CCCGCACCAG GCGGCCGCCT TCTACCGCGA GCACGATGTT GTGGAGCCCT GTGTCGCCAC TGCCGAGCAG TATCAGGGCA AGGCGCTGTC CTACAACAAC GTGGCCGATA CCGATGCGGC GCTGGAGTGT GTCAAGGCCT TCGAGGCGCC GGCCTGTGTC ATCGTCAAGC ACGCCAACCC CTGCGGGGTG GCCGTCGGCC AGGACCTGCT GAGTGCCTAT GAGCGCGCCT TTGAGGCGGA CCCCACCTCG GCCTTCGGCG GCATCATCGC CTTCAACCGC GAGTTGGACG GCAGGACCGC GGCGGCGATC GTCGAGCGCC AGTTTGTGGA GGTGATCATC GCCCCCAGCG TCACCGCGGA GGCCCGCGAG GCCGTGGCCG CCCGCAAGAA CGTCCGCCTG CTGGCCTGCG GCCAGTGGGG GCCGGAGCGG GCCCCGGGCC TGGACTACAA GCGGGTGGGC GGCGGCCTGC TGGTGCAGGA GCGGGACATC GCCCGGGTGC CGCAGGGGGC GCTCAAGGTG GTCACCCGCA AGCAGCCGGA CGAGCAGACC TGGCAGGACC TGCTGTTTGC CTGGGCGGTG GTGCGCTACG TCAAGTCCAA CGCCATCGTA TTCGCCGCCG ACGGGCGCAG CCTGGGCATC GGCGCCGGGC AGATGAGCCG GGTGTTCAGC ACCCGTATCG CCCGCGACAA GGCTGCCGAG GCCGGACTGG AGGTCAAGGG CGCGGCCATG GCCTCCGACG CCTTCTTCCC CTTCCGCGAC GGCCTCGACC AGGCGGCGGA GGCGGGCATT GGGGCGGTGA TCCAGCCGGG TGGCTCGATG CGCGACCAAG AGGTGATCGA CGCCGCTGAC GAGCATGGGC TGGTCATGGT ATTCACGGGT ATGCGGCACT TCCGGCACTG A
|
Protein sequence | MATVSDLRPV RRALISVSDK TGIERFARAL HEQGVELLST GGTARLLREA GLPVTEVSDY TGFPEIMAGR VKTLHPKVHG GLLGRRGTDD EVMAEQGIQP IDLLAVNLYP FERTVADPDC RLEEAIENID IGGPAMLRAA AKNHADVAVV TDPADHEALI DELKREGGLG RATRFNLAVK AFEHTARYDG AIASYLGARL GEGEPARFPR TFNVQFEKAL DMRYGENPHQ AAAFYREHDV VEPCVATAEQ YQGKALSYNN VADTDAALEC VKAFEAPACV IVKHANPCGV AVGQDLLSAY ERAFEADPTS AFGGIIAFNR ELDGRTAAAI VERQFVEVII APSVTAEARE AVAARKNVRL LACGQWGPER APGLDYKRVG GGLLVQERDI ARVPQGALKV VTRKQPDEQT WQDLLFAWAV VRYVKSNAIV FAADGRSLGI GAGQMSRVFS TRIARDKAAE AGLEVKGAAM ASDAFFPFRD GLDQAAEAGI GAVIQPGGSM RDQEVIDAAD EHGLVMVFTG MRHFRH
|
| |