Gene Mlg_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2007 
Symbol 
ID4269607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2277974 
End bp2279932 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content66% 
IMG OID638126763 
Productheat shock protein 90 
Protein accessionYP_742839 
Protein GI114321156 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0326] Molecular chaperone, HSP90 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAG ACGCACACAA GGAAACCCTG GAGTTCCAGG CCGAGGTGCA GCAGCTCCTG 
CACCTGATGA TCCACTCGCT CTACTCCAAC AAGGACATCT TCCTCCGCGA GCTGATCTCC
AACGCCTCGG ACGCCATCGA CAAGCTCCGC TTCCAGTCCC TGCAGGATGA ATCCCTGCTC
GAGGGCGAGG GCGACCTGCG TATCCGCGTC TCCGTGGACA AGGACGCCCG CACCATCACC
GTGGCCGACA ACGGCATCGG CATGACCCGC GACGAGGTGG CGGAGAACCT GGGTACCATT
GCCCGCTCCG GCACCAAGGC CTTCCTGGAT CAGCTCACCG GCGACCAGCA AAAGGACGCC
AAGCTGATCG GCCAGTTCGG CGTGGGCTTC TACTCGGCCT TCGTGGTGGC CGAGCACGTC
ACCGTGCATA CCCGCAAGGC CGGCCTGGGC GCCGAGCACG GGGTGCGCTG GTCCTCCGAC
GGCAAGGGCG CCTACACCCT GGAGAACGAG GAAGTGGCGG AGCGCGGCAC CCGGGTGGTG
CTGACCCTGC CCGAATCCCA GAGCGAATAC CTGGACGACT GGCGGCTGAA GGGCATCATC
CGCCGCTACT CTGACCACAT CGACGTGCCC ATCCAGATGC CGGCCCAGGC CGAGGACAAG
GACGAGCCGG AAGATGAGGC CGAAAAGGCG GAAGCCGCCG AGACCTGGGA GACGGTGAAC
AACACCAACG CCCTGTGGAT GCGCCCCAAG TCGGAGATCA GCGACGACGA CTACAAGGCC
TTCTACAAGC ACGTGGCCCA CGACTTCGAC GACCCCATGG TCTGGCTGCA CAACCACGTG
GAGGGGCGCC AGTCCTACAC CTCGCTGCTC TACATTCCGA AGAACCCGCC CTTCGACCTC
TACGAGCGGG AGCCGGCCCA CGGCATCAAG CTCTACGTGC GCCGGGTCTT CATCATGGAG
GACACCGAGA AGCTGATGCC GCGCTACCTG CGCTTCGTGC GTGGCCTGGT GGACTCCGAC
GACCTGCCGC TCAACGTCTC CCGCGAGCTG CTCCAGCACA ACCCGCTCCT GGACAAGATC
CGCAGCGCCT CGGTCAAGCG CATTCTCGAC CGCCTGGAGA AGATGGCCAA GAACGAGCCG
GAGCAGTACG CCGAGTTCTA CGGCAACTTC GGCAAGGTGC TGAAGGAGGG CGTGGCCGAG
GACTTCGCCA ACCGCGAGCG CATCGCCAAG CTGCTGCGCT TCTCCACCAC CCAAGACGAG
AATGAGACGC CGGATGTCAG CCTGGACGAC TACATCGCCC GTATGAAAGA GGGCCAGGAG
GCCATCTACT ACGTCACCGC CGAGAGCTTC AACGCCGCCC GCAACAGTCC CCACCTGGAG
GTGTTCCGCA AGAAGGGGGT CGAAGTGCTG CTGCTGCCCG ACCCGGTGGA CGAGTGGGTC
ATCACCCACC TGAATGAATA CGACGGCAAG CCGCTGAAGT CGGTGGCCAA GGGCGGCCTG
GACCTGGGCG AGCTGGAGGA CCAGGCGGAG AAGAAGGCCG CGGAAGAGGC CACCGAAAGC
CACAAGGACC TGCTGGAGAA GCTCAAGGGC GCGCTGGAGG ACAAGGTGAG CGAGGTGCGG
GTCTCCACCC GCCTGACCGA CTCCCCGGCC TGCCTGGTGG TGGGCGAGTA CGACTTCGGC
ATGGGCATGC AGCGGTTGCT CAAGGCCGCC GGCCACGCTA TGCCCCAGGG CAAGCCGGCG
CTGGAGATCA ACATCGACCA CCCCATCGTT CAGCGCATGG ACACGGGGCT GGACGATGCC
CGCTTCAGCG ATTGGGCGGC GGTGCTCTAC GACCAGGCCC TGCTCACCGA GGGCGGGCAG
CTGGAGGACC CGGCCGCCTT CGTCAAGCGG GTCAACGCCC TGCTCACCGA GCAGGCCCGG
GCCGGCGAAG CAAAGAGCAA CGCCGCCCGC GGGGACTGA
 
Protein sequence
MATDAHKETL EFQAEVQQLL HLMIHSLYSN KDIFLRELIS NASDAIDKLR FQSLQDESLL 
EGEGDLRIRV SVDKDARTIT VADNGIGMTR DEVAENLGTI ARSGTKAFLD QLTGDQQKDA
KLIGQFGVGF YSAFVVAEHV TVHTRKAGLG AEHGVRWSSD GKGAYTLENE EVAERGTRVV
LTLPESQSEY LDDWRLKGII RRYSDHIDVP IQMPAQAEDK DEPEDEAEKA EAAETWETVN
NTNALWMRPK SEISDDDYKA FYKHVAHDFD DPMVWLHNHV EGRQSYTSLL YIPKNPPFDL
YEREPAHGIK LYVRRVFIME DTEKLMPRYL RFVRGLVDSD DLPLNVSREL LQHNPLLDKI
RSASVKRILD RLEKMAKNEP EQYAEFYGNF GKVLKEGVAE DFANRERIAK LLRFSTTQDE
NETPDVSLDD YIARMKEGQE AIYYVTAESF NAARNSPHLE VFRKKGVEVL LLPDPVDEWV
ITHLNEYDGK PLKSVAKGGL DLGELEDQAE KKAAEEATES HKDLLEKLKG ALEDKVSEVR
VSTRLTDSPA CLVVGEYDFG MGMQRLLKAA GHAMPQGKPA LEINIDHPIV QRMDTGLDDA
RFSDWAAVLY DQALLTEGGQ LEDPAAFVKR VNALLTEQAR AGEAKSNAAR GD