Gene Mlg_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2017 
Symbol 
ID4269617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2289957 
End bp2292278 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content66% 
IMG OID638126773 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_742849 
Protein GI114321166 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCGG TCAGCCGGCA TTCGGTGCCC AAGCGTCTGG CAGTCCGCGT ATGCGGCCGG 
GTCCAGGGTG TGGGCTTCCG GCCTTTCGTC AAGTGCCTTG CTGACAGCTG CAACGTTAGT
GGTTGGGTCC GCAACGATGC GGGCGGCGTC AGCCTGGAGG TCCAGGGAGG CGGGTCGCAG
CTTGCCCGGT TTATGGTGGC TCTGCGAGAG CAGGCACCGC CCCTGGCCCG CATCGAGGCG
GTGGTAGAGA ACCCCTGCCC TTTGCAGCCC CGGGAGCGTG GGTTTCGTAT CCGGCCCAGC
AGCGGCGGTG CGGTGGCAAC CGAAATCACG CCGGATGCGG CGGTTTGTGA TGCCTGTCTC
GAAGAGCTGT TCGATCCGGC AGACCGGCGC TACCGTTACC CTTTTATCAA CTGCACCCAC
TGCGGCCCAC GGTACACACT CACCTCCGGA CTGCCCTACG ACCGGGTCCG GACCAGCATG
GCCAAATTCC CCCAATGCCC CCGGTGTCTG GCGGAGTATG AGGACGCCGG CGATCGCCGG
TACCATGCCC AGCCGAATGC CTGCCCGGAG TGCGGCCCCA GTCTGCAACT ACTTGACGGT
GACGGACATC CGCTGCCCGT GGGGGACGTA CTTGCGGCCA CGGTCGACCG GTTGACCAGC
GGACAGATCC TCGCGATCAA AGGGTTGGGC GGCTTCCATC TCCTGTGCGA TGCCGGAAAC
CCGGAAGCTG TTGCGCGTCT GCGGACCCGC AAACGGCGGT CCGAGAAGCC CTTCGCCGTC
ATGGTCCCCG GCATTGAGGC AGCCGAGAGG CTGGTACGAC TGGCCCCAGG CGAGCGCCAC
CTGCTGGCGG GCGTCGACCG TCCAATCCTG CTGGCTGGGA AGCGTCCCAC CGTGGATCGG
CAACTGCCTG GCGTGGCGCC AGGGATGCCT TGGCTGGGCG TGATGTTGCC GTATACGCCT
TTGCATTACC TGCTGTTTCA CGAAGCCGCC GGACGCCCGA TGGGATTGGC TTGGTTGTCC
GGCACCGGGC AGCCGGTGTG GGTCTGTACC TCCGCCAACC CTGGCGGAGA GCCTCTGGTC
ACGGATAACC AGGACGCTGT GCGCCGTCTT TCTGGCCTGG CGGATGCGCT GCTGGTGCAT
GACCGGGACA TTCTGGTGCG TTGCGACGAC AGCGTGGTTC GTCAGGCAGA GGGACGGGCC
GTGTACCTTC GCCGTGGCCG GGGGGTCACC CCATCGCCGA TCCGATTGCC CCGTGGCGGG
CCGCCGGTCC TCGCTACCGG CGGCCATCTG AAGAACACCG TTTGCCTGAC ACGCGGTGAC
CAGGCGTACA TATCCCAGCA CGTTGGGGAT CTGGACAACG CCGCTACCTG TCGAGCCTTG
GACGACACCG TAGCCCATCT GCAGAGGGTG TTGGCGGTCA GGCCCGACTG CGTCGCCTGT
GACCGTCATC CTGATTTTTA CAGCAGTGCC CTGGCTGCCC GCTTAGCGGC GGAGTGGGCG
ATCCCCTTGG TCAGGGTGCA GCACCATCAT GCGCATCTGG CCGCAGTGCA GGCCGAGCAT
GGCCTGGAAG GCCCGATGTT GGGTGCAGCA CTGGATGGCG TGGGGTTTGG GGATAACGGC
GAGCCCTGGG GGGGCGAACT GCTCGCCCTT GATGGTAGGG GCGGCTTTCG CCGCCTTGGT
TTCCTGCGCC CTCTGCCTTT GCCGGGGGGG GACCGGGCTG CCCGGGAACC CTGGCGCATG
GCTGCGGCCG CGCTGCACGA GCTCGGTTGG GGCGAGCGGA TTCCCGCATG GTTCCCGGAA
CATCCCCGTG CACAAGGGCT GGCCGGAATG CTGGCTACGG GCACCCGCTG CCCGCGGACC
AGTAGCCTTG GACGCTTGTT CGATGCCGCG GCGGGCCTAC TCAAGGTGAA GGCGGTCAGC
CGGTTCGAGG GACAGGCCGC CATGCTGCTC GAAGGTCTTG CTGCCGAGCA TGGGGCGGTT
CCGGCATGGG AGGGGGGCTG GACGTTGGAT CAGGAGGGGC TCGACTTTCG TCCCTTGCTT
GCGGAGCTGA CCCGGAGCTC GGAACGCGGG TTCGGAGCTG CTCTGTTCCA CGCCACGCTG
GCTGCCGGCC TGGCTGACTG GTTGTGTCGG GCGGCAGAGA AAGCAGGGCA GAAACGAGTG
GCGATCGCGG GAGGGTGCTG TGCCAATCAG GTGATGATGC GCGATCTTTG CATGCGACTG
GAACACGCTG GCCTGAGTGT ATACCAGGCC CGACAGGCGC CGCCCAACGA CGCCGGGTTG
AGTCTTGGAC AAGCCTGGGT GGCGTTACAA ACCGTGAGGT AA
 
Protein sequence
MNAVSRHSVP KRLAVRVCGR VQGVGFRPFV KCLADSCNVS GWVRNDAGGV SLEVQGGGSQ 
LARFMVALRE QAPPLARIEA VVENPCPLQP RERGFRIRPS SGGAVATEIT PDAAVCDACL
EELFDPADRR YRYPFINCTH CGPRYTLTSG LPYDRVRTSM AKFPQCPRCL AEYEDAGDRR
YHAQPNACPE CGPSLQLLDG DGHPLPVGDV LAATVDRLTS GQILAIKGLG GFHLLCDAGN
PEAVARLRTR KRRSEKPFAV MVPGIEAAER LVRLAPGERH LLAGVDRPIL LAGKRPTVDR
QLPGVAPGMP WLGVMLPYTP LHYLLFHEAA GRPMGLAWLS GTGQPVWVCT SANPGGEPLV
TDNQDAVRRL SGLADALLVH DRDILVRCDD SVVRQAEGRA VYLRRGRGVT PSPIRLPRGG
PPVLATGGHL KNTVCLTRGD QAYISQHVGD LDNAATCRAL DDTVAHLQRV LAVRPDCVAC
DRHPDFYSSA LAARLAAEWA IPLVRVQHHH AHLAAVQAEH GLEGPMLGAA LDGVGFGDNG
EPWGGELLAL DGRGGFRRLG FLRPLPLPGG DRAAREPWRM AAAALHELGW GERIPAWFPE
HPRAQGLAGM LATGTRCPRT SSLGRLFDAA AGLLKVKAVS RFEGQAAMLL EGLAAEHGAV
PAWEGGWTLD QEGLDFRPLL AELTRSSERG FGAALFHATL AAGLADWLCR AAEKAGQKRV
AIAGGCCANQ VMMRDLCMRL EHAGLSVYQA RQAPPNDAGL SLGQAWVALQ TVR