Gene Mlg_2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2063 
Symbol 
ID4270449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2338968 
End bp2339984 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content73% 
IMG OID638126819 
ProductAIR synthase-like protein 
Protein accessionYP_742895 
Protein GI114321212 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.216375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG ACCCCATGCT GGAGAGCGGC AAGCTGCCGC CGGAACAGCT CGCCCGACTG 
CTGGCGGGCC TGCCGCCCAC CGGCGCCGAT GTGGTGATGG GGCCGGGGGT CGGGCTGGAC
TGTGCCGTGG TCCGCCACGG TGGGCACCTG CTGGTCTGCA AGTCCGACCC CATCACCTTC
GTGGCCGATG ACCTGGGCCA CTACCTGGTC CAGGTGAATG CCAACGATGT CGCCACCACC
GGTGCTACGC CGCGCTGGTT GCTGGTCACG CTGTTGCTGC CGGCCGGAGG CACGCCGCGG
GCCCTCCCGG AGCAGCTCAT GGGACAGATC AGCGAGGCCT GTGAGCGGTT GGGGATCGCA
CTGATCGGTG GACACACGGA GGTGACCACG GCGGTCACCC GGCCGGTGGC GGTCGGCGCC
CTGCTGGGCG AGGTGACCGA GGCGCGGCTG GTGACGCCGC AGGGTGCCCG ACCGGGCGAT
CGGCTCCTGC TGACCAAGGG CGTGCCGTTG GAGGGGACGG CGATCCTGGC CAGCGATTGC
CGGGCGCGGC TGGCGGACCG CTTCAGCGTG GCGGAGTTGG ACGCGGCGGC GGCGTTCCTG
GAGCGGCCGG GGATCAGCGT GGTGGCGGAC GCCCGCATTG CCCTGGCGGC GGGGCGGGTC
AATGCCATGC ACGACCCCAC GGAGGGGGGC GTGAAGGCGG CACTGTGGGA GCTGGCGCAG
GCCAGCGGGC GTCGTCTGCG GGTGGAGGCG GGAGCGATCC CGGTGCCGGC ACTGTCGCGC
CGGATCTGTA GCCATCTCGG TCTGGACCCC CTGGCGACCA TCGCCTCCGG CGCGCTGCTG
TTGGCGGTGC CGGCTGCTGA GGCGCCGGCG GTGCGACAGG CACTGGACGG CTCTGACATC
CCCTGCGCCG ATATCGGCGG CGTCGAGGCG GGGCCGGCAG CGGTTATCTG GTGCACGGAG
ACCGGCTGCG GGCCGTTGGC GCCGCCCGGT CAAGACGAGC TGGCACGATT GGTCTGA
 
Protein sequence
MTDDPMLESG KLPPEQLARL LAGLPPTGAD VVMGPGVGLD CAVVRHGGHL LVCKSDPITF 
VADDLGHYLV QVNANDVATT GATPRWLLVT LLLPAGGTPR ALPEQLMGQI SEACERLGIA
LIGGHTEVTT AVTRPVAVGA LLGEVTEARL VTPQGARPGD RLLLTKGVPL EGTAILASDC
RARLADRFSV AELDAAAAFL ERPGISVVAD ARIALAAGRV NAMHDPTEGG VKAALWELAQ
ASGRRLRVEA GAIPVPALSR RICSHLGLDP LATIASGALL LAVPAAEAPA VRQALDGSDI
PCADIGGVEA GPAAVIWCTE TGCGPLAPPG QDELARLV