Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1035 |
Symbol | |
ID | 4269776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1183364 |
End bp | 1184632 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125787 |
Product | thiamine biosynthesis/tRNA modification protein ThiI |
Protein accession | YP_741878 |
Protein GI | 114320195 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.533687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.416754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCAG CACAACCCGC GGATCAACGC CCGGTACTGG GCCGCTTCCA GCCCAATCGC CTGCTCATCC ATTACGGCGA GGTCGCCCTG AAGGGGCGGA ATCGCAGCCG TTTCGAGCGC GCGCTGCGCC ACAACCTGCG CCATCGGCTG CGGGCCGCCG GCCTGCGCCC CGAGATCCAC CAGGGCCACC AGCGCATCTG GGTGGATCTG CAAGGCCTGT CGGAGGCGCA GCGCCAGGAG GTGCTCCAGG CCGCGGCGGA GACCCCGGGG ATCGTCACCT ACCAGCCGGT GCACTACCTG GCCGCCAGCG CCGACCGCCA GGCCATGCTG GAACAGGCCC GCATGCTGCT GTCCGACCTG GCCAACCGTA TGGAGAACGC CGGCCAGGGG AGCTTTGCCG TCCGGGTCAA GCGGGCCGAC AAGCGCTTTC CCCTGCCCTC GGACCAGATC GCGGCACGGA TCGGTCAGAC CATCATCGAA CACAGCCCCT GGCAGCGGGT GGACCTCCGC CGCCCCGACC AGACCTTCCA CCTGGACATC TACAGCGACG GCATGTACTG CTACGGCACC CGTCACCGGG GCCTGGGCGG GCTGCCGGTA GGGCCCGGGG GTCGAGCGCT GACCCTGCTC TCCGGGGGCA TCGACTCGCC GGTGGCGGCC TTCCTGATGG CCAAGCGCGG CTGCCGGGTG GATTTCCTGC ATTTCACCGC CAGCCCGTCG CAGCAGCGTG CCGACGCCGA CCACACGGTG ATCCGGCTGG CGCGCCAACT GTCCCGCTAC ACCGGCCATT CGCGGCTCTA CATGGTGCCC TATGATCACT TCGACATGGC GCTGCACGGC GACCAGCGCG GCCACGAAGT GGTGTTGTTC CGCCGCTTCG TCGCTCGCAC CGGCGAGGCG CTGGCCAAGC GTTTAAGGGC GCTGGCGCTG ATTTCCGGTG ACAGCCTGGG CCAAGTGGCC TCGCAAACCA TGGAGAACAT GGTCAGCACC TCGCTGGCCG TGCGGATGCC GATCATGCGC CCGCTGGTGG GGCTGGACAA GGCGGATATC GTGGCCATTG GCCGGCGCAT CGGGACTTAC GACATCTCCA TCCAGCCCTA TAAGGATTGC TGCGCGCTGT TGGACCACCA ACCCCGCACC CGCACCCTGC CCTCCAACCT GGAGGCCATC GAAGCCCAAT TGGGGCTGGA TGACCCGGCC CTGGTGGAGG CCACGCTGGC CGACGCCGTC TGTCTGGAGT TCGCCGCCGG CCGCGACCTC AAGCGCTAA
|
Protein sequence | MTAAQPADQR PVLGRFQPNR LLIHYGEVAL KGRNRSRFER ALRHNLRHRL RAAGLRPEIH QGHQRIWVDL QGLSEAQRQE VLQAAAETPG IVTYQPVHYL AASADRQAML EQARMLLSDL ANRMENAGQG SFAVRVKRAD KRFPLPSDQI AARIGQTIIE HSPWQRVDLR RPDQTFHLDI YSDGMYCYGT RHRGLGGLPV GPGGRALTLL SGGIDSPVAA FLMAKRGCRV DFLHFTASPS QQRADADHTV IRLARQLSRY TGHSRLYMVP YDHFDMALHG DQRGHEVVLF RRFVARTGEA LAKRLRALAL ISGDSLGQVA SQTMENMVST SLAVRMPIMR PLVGLDKADI VAIGRRIGTY DISIQPYKDC CALLDHQPRT RTLPSNLEAI EAQLGLDDPA LVEATLADAV CLEFAAGRDL KR
|
| |