Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2017 |
Symbol | |
ID | 4269617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2289957 |
End bp | 2292278 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126773 |
Product | (NiFe) hydrogenase maturation protein HypF |
Protein accession | YP_742849 |
Protein GI | 114321166 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCGG TCAGCCGGCA TTCGGTGCCC AAGCGTCTGG CAGTCCGCGT ATGCGGCCGG GTCCAGGGTG TGGGCTTCCG GCCTTTCGTC AAGTGCCTTG CTGACAGCTG CAACGTTAGT GGTTGGGTCC GCAACGATGC GGGCGGCGTC AGCCTGGAGG TCCAGGGAGG CGGGTCGCAG CTTGCCCGGT TTATGGTGGC TCTGCGAGAG CAGGCACCGC CCCTGGCCCG CATCGAGGCG GTGGTAGAGA ACCCCTGCCC TTTGCAGCCC CGGGAGCGTG GGTTTCGTAT CCGGCCCAGC AGCGGCGGTG CGGTGGCAAC CGAAATCACG CCGGATGCGG CGGTTTGTGA TGCCTGTCTC GAAGAGCTGT TCGATCCGGC AGACCGGCGC TACCGTTACC CTTTTATCAA CTGCACCCAC TGCGGCCCAC GGTACACACT CACCTCCGGA CTGCCCTACG ACCGGGTCCG GACCAGCATG GCCAAATTCC CCCAATGCCC CCGGTGTCTG GCGGAGTATG AGGACGCCGG CGATCGCCGG TACCATGCCC AGCCGAATGC CTGCCCGGAG TGCGGCCCCA GTCTGCAACT ACTTGACGGT GACGGACATC CGCTGCCCGT GGGGGACGTA CTTGCGGCCA CGGTCGACCG GTTGACCAGC GGACAGATCC TCGCGATCAA AGGGTTGGGC GGCTTCCATC TCCTGTGCGA TGCCGGAAAC CCGGAAGCTG TTGCGCGTCT GCGGACCCGC AAACGGCGGT CCGAGAAGCC CTTCGCCGTC ATGGTCCCCG GCATTGAGGC AGCCGAGAGG CTGGTACGAC TGGCCCCAGG CGAGCGCCAC CTGCTGGCGG GCGTCGACCG TCCAATCCTG CTGGCTGGGA AGCGTCCCAC CGTGGATCGG CAACTGCCTG GCGTGGCGCC AGGGATGCCT TGGCTGGGCG TGATGTTGCC GTATACGCCT TTGCATTACC TGCTGTTTCA CGAAGCCGCC GGACGCCCGA TGGGATTGGC TTGGTTGTCC GGCACCGGGC AGCCGGTGTG GGTCTGTACC TCCGCCAACC CTGGCGGAGA GCCTCTGGTC ACGGATAACC AGGACGCTGT GCGCCGTCTT TCTGGCCTGG CGGATGCGCT GCTGGTGCAT GACCGGGACA TTCTGGTGCG TTGCGACGAC AGCGTGGTTC GTCAGGCAGA GGGACGGGCC GTGTACCTTC GCCGTGGCCG GGGGGTCACC CCATCGCCGA TCCGATTGCC CCGTGGCGGG CCGCCGGTCC TCGCTACCGG CGGCCATCTG AAGAACACCG TTTGCCTGAC ACGCGGTGAC CAGGCGTACA TATCCCAGCA CGTTGGGGAT CTGGACAACG CCGCTACCTG TCGAGCCTTG GACGACACCG TAGCCCATCT GCAGAGGGTG TTGGCGGTCA GGCCCGACTG CGTCGCCTGT GACCGTCATC CTGATTTTTA CAGCAGTGCC CTGGCTGCCC GCTTAGCGGC GGAGTGGGCG ATCCCCTTGG TCAGGGTGCA GCACCATCAT GCGCATCTGG CCGCAGTGCA GGCCGAGCAT GGCCTGGAAG GCCCGATGTT GGGTGCAGCA CTGGATGGCG TGGGGTTTGG GGATAACGGC GAGCCCTGGG GGGGCGAACT GCTCGCCCTT GATGGTAGGG GCGGCTTTCG CCGCCTTGGT TTCCTGCGCC CTCTGCCTTT GCCGGGGGGG GACCGGGCTG CCCGGGAACC CTGGCGCATG GCTGCGGCCG CGCTGCACGA GCTCGGTTGG GGCGAGCGGA TTCCCGCATG GTTCCCGGAA CATCCCCGTG CACAAGGGCT GGCCGGAATG CTGGCTACGG GCACCCGCTG CCCGCGGACC AGTAGCCTTG GACGCTTGTT CGATGCCGCG GCGGGCCTAC TCAAGGTGAA GGCGGTCAGC CGGTTCGAGG GACAGGCCGC CATGCTGCTC GAAGGTCTTG CTGCCGAGCA TGGGGCGGTT CCGGCATGGG AGGGGGGCTG GACGTTGGAT CAGGAGGGGC TCGACTTTCG TCCCTTGCTT GCGGAGCTGA CCCGGAGCTC GGAACGCGGG TTCGGAGCTG CTCTGTTCCA CGCCACGCTG GCTGCCGGCC TGGCTGACTG GTTGTGTCGG GCGGCAGAGA AAGCAGGGCA GAAACGAGTG GCGATCGCGG GAGGGTGCTG TGCCAATCAG GTGATGATGC GCGATCTTTG CATGCGACTG GAACACGCTG GCCTGAGTGT ATACCAGGCC CGACAGGCGC CGCCCAACGA CGCCGGGTTG AGTCTTGGAC AAGCCTGGGT GGCGTTACAA ACCGTGAGGT AA
|
Protein sequence | MNAVSRHSVP KRLAVRVCGR VQGVGFRPFV KCLADSCNVS GWVRNDAGGV SLEVQGGGSQ LARFMVALRE QAPPLARIEA VVENPCPLQP RERGFRIRPS SGGAVATEIT PDAAVCDACL EELFDPADRR YRYPFINCTH CGPRYTLTSG LPYDRVRTSM AKFPQCPRCL AEYEDAGDRR YHAQPNACPE CGPSLQLLDG DGHPLPVGDV LAATVDRLTS GQILAIKGLG GFHLLCDAGN PEAVARLRTR KRRSEKPFAV MVPGIEAAER LVRLAPGERH LLAGVDRPIL LAGKRPTVDR QLPGVAPGMP WLGVMLPYTP LHYLLFHEAA GRPMGLAWLS GTGQPVWVCT SANPGGEPLV TDNQDAVRRL SGLADALLVH DRDILVRCDD SVVRQAEGRA VYLRRGRGVT PSPIRLPRGG PPVLATGGHL KNTVCLTRGD QAYISQHVGD LDNAATCRAL DDTVAHLQRV LAVRPDCVAC DRHPDFYSSA LAARLAAEWA IPLVRVQHHH AHLAAVQAEH GLEGPMLGAA LDGVGFGDNG EPWGGELLAL DGRGGFRRLG FLRPLPLPGG DRAAREPWRM AAAALHELGW GERIPAWFPE HPRAQGLAGM LATGTRCPRT SSLGRLFDAA AGLLKVKAVS RFEGQAAMLL EGLAAEHGAV PAWEGGWTLD QEGLDFRPLL AELTRSSERG FGAALFHATL AAGLADWLCR AAEKAGQKRV AIAGGCCANQ VMMRDLCMRL EHAGLSVYQA RQAPPNDAGL SLGQAWVALQ TVR
|
| |