Gene Mlg_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1899 
Symbol 
ID4270099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2164868 
End bp2166019 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content68% 
IMG OID638126655 
Productchaperone protein DnaJ 
Protein accessionYP_742733 
Protein GI114321050 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.61643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.626034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAA GCGATTACTA CGAAGCCCTG GGCGTTGCCC GCAACGCCTC GGATTCAGAG 
ATCAAAAAGG CCTACCGCCG CATGGCCATG AAGTATCACC CGGACCGCAA TCCGGGTGAC
AAGGAGGCGG AGGCCCGCTT CAAGGAGGCC AAAGAGGCCT ACGAGATTTT GTCCGACCCG
CAGAAGCGGG CCGCCTACGA CCAGTTCGGT CACGCCGGTG TGGACCCCTC CGCCGGCATG
GGGGGCGCCG GGGGGCCGGG TGGCCCGGGC GGGCCGGATT TCGCCGATAT CTTCTCCGAC
GTCTTCGGCG ACATCTTCGG GGGCGGCGGT CGCCGTGGCG GCGGGGGCCG GCGCGTCTTC
CGTGGCGCGG ACCTGCGCTA CAACCTGGAG CTGTCGCTGG AGGACGCGGT GCGCGGCACC
GAGGTGCAGA TCCGGGTGCC CACCCAGGAG GTCTGTGACG CCTGTGACGG CAAGGGCACC
AAGGAGGGCA GCCAGCCCGA GACCTGCCCC ACCTGTAAGG GCCACGGTGA TGTCCGTATC
CAGCAGGGCT TCTTCTCCGT GCAGCAGACC TGCCCGCGCT GTGGCGGCAG TGGTTCGGTT
ATCACCGACC CGTGCCGTAA GTGTGGCGGG CGCGGGCGAG TGCAGTCGCA GAAAACGCTC
TCCGTACGGG TGCCGGCCGG GGTGGATACC GGCGACCGGA TCCGCCTGTC CGGCGAGGGC
GAGCCCGGTG AGAATGGCGG TCCGCCCGGC GATCTGTACG TGCAGATCAT GGTGCGTGAG
CACGAGTTCT TCCAGCGCGA CGGGGCCAAT CTGCGCTGCG AGGTGCCCAT CAGTATCACC
AAGGCCGCCC TCGGCGGTGA GGTGGAGGTG CCGACCCTGG ACGGGCGCGT CAACCTGCGC
ATCCCCGCCG GCGCCCAGTC GGGCAAGGTC TTCCGGGTGC GCGGCAAGGG GGTGAAGCCG
GTGCGCGGCG GTCCGCAGGG CGACTTGCTC TGCCGGGTGC ACGTGGAGAC CCCGGTCAAC
CTCACCAAAA AGCAGAAGGA GCTGCTGGAG GAGTTTGGCC GCACCATGGA TGACACCGGC
GACAAGCACA CCCCGCGGAC CAGCTCCTGG CTGGACAAGG CGCGCAAATT CTTCGAGGAC
TGGAAGCTCT GA
 
Protein sequence
MSKSDYYEAL GVARNASDSE IKKAYRRMAM KYHPDRNPGD KEAEARFKEA KEAYEILSDP 
QKRAAYDQFG HAGVDPSAGM GGAGGPGGPG GPDFADIFSD VFGDIFGGGG RRGGGGRRVF
RGADLRYNLE LSLEDAVRGT EVQIRVPTQE VCDACDGKGT KEGSQPETCP TCKGHGDVRI
QQGFFSVQQT CPRCGGSGSV ITDPCRKCGG RGRVQSQKTL SVRVPAGVDT GDRIRLSGEG
EPGENGGPPG DLYVQIMVRE HEFFQRDGAN LRCEVPISIT KAALGGEVEV PTLDGRVNLR
IPAGAQSGKV FRVRGKGVKP VRGGPQGDLL CRVHVETPVN LTKKQKELLE EFGRTMDDTG
DKHTPRTSSW LDKARKFFED WKL