Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2020 |
Symbol | |
ID | 4269620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2293568 |
End bp | 2294707 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126776 |
Product | hypothetical protein |
Protein accession | YP_742852 |
Protein GI | 114321169 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCGG GCGCGGAATT GGTGCTGCAG CTTCATCGGC GGCGCGGTCG GATCAACGAA GTCGTTCTGC AGGGACGGCG TCCACCTTGG GTGGGTGACC TACTGCGGGG TTCGGCCGCC GGCTATGTGC CGGGGCGCAT GCGGCTGCTG TTCGGTGTGT GCGGCGAGGC GCAGGCAACC GCTGCGCGCG CGGCGCTGGC GGGTGCGGGC GTGCAGCGCG GTGGGGGGGC GGGTGTCCAA GAGCACCCCG GTGTTCTGCT GGAGTGGATC CGCGAGCACC TTTGGCGGCT GGGGCTGTGT CTGCCTCGAC AGTTGCTGAA GGGCACGCCG GCAGGGCTTA GCGAAGCCAA CAGGGCCTTG CGGGGGGTAA TGTGCGCGGT GGGTGATGAC ATAAGGGGGC GCGACGCCGC CGTAAACCGG CTGGAGTCTG CCTTGTTCGA AATCGTGGGG ACCTCGGTAC TGGCCGACAG CGCCGACGTC GATAGCTGGC TGGATTGCTG CGATAGCGAA GTGGCGGCGG CGCTGCGCTG GGTGCGTGAT GAGGTGCCGC AGGGCTTTGG TGTGGCTAGT TTGCCGTCGT TGGAGCCGGC TGGTTTACGC GCCGTGGGCG ACTGCTTGCA AGGTTCACGC GACCTTGCGG TATGGCCCCT CGACCCGCGA GACGAGGCAG GCGAATCCGG CACGGCCCAG GCTGGGCTAC GGGAGACTGG ACCCTTCGCC CGCCAGCGGG CGAACCCGGC ACTCCAGGGG GTGCTCGCCC GCCATGGCCC GGGAATCCTG CCCCGCCTCT TGGCCGCGGT GCTGGAACTA TTGGTTCTGC CCGGCCGCCT GCGCAATGCC CTACGGGCAA TAGGCGTTGA ACCTGTTTCC GAGGAGGTGG GTCAGGTCAC GGGCACCGGC TGTGGCATCG TGGATACGGC CCGCGGTCTT CTGCTGCATT GGGTGCGGTT GCACAGGGGC CAGGTCGCGG ATTATGCCGT TGTCGCCCCC ACCGAATGGA ACTTTCATCC CCGGGGTGTC TTGGTGAGCG GGCTTCGTGA GCGACCAGGC GGAGGCCCGG CGGAGGGGGT CCGGCGGCTT GCGGAACTGG CAGTGCTGGC TCTGAACCCC TGTGTCGGCA GCCGGGTGGA GGTCCACTGA
|
Protein sequence | MSSGAELVLQ LHRRRGRINE VVLQGRRPPW VGDLLRGSAA GYVPGRMRLL FGVCGEAQAT AARAALAGAG VQRGGGAGVQ EHPGVLLEWI REHLWRLGLC LPRQLLKGTP AGLSEANRAL RGVMCAVGDD IRGRDAAVNR LESALFEIVG TSVLADSADV DSWLDCCDSE VAAALRWVRD EVPQGFGVAS LPSLEPAGLR AVGDCLQGSR DLAVWPLDPR DEAGESGTAQ AGLRETGPFA RQRANPALQG VLARHGPGIL PRLLAAVLEL LVLPGRLRNA LRAIGVEPVS EEVGQVTGTG CGIVDTARGL LLHWVRLHRG QVADYAVVAP TEWNFHPRGV LVSGLRERPG GGPAEGVRRL AELAVLALNP CVGSRVEVH
|
| |