Gene Nther_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0450 
Symbol 
ID6315275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp473698 
End bp475344 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content41% 
IMG OID642642834 
Productchaperonin GroEL 
Protein accessionYP_001916634 
Protein GI188585089 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000441199 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGG ACATTAAATT TAGAGAAGAT GCTCGTGCCA GATTAGAGCA AGGTGTAAAT 
AAGTTAGCTG ACACCTTAAA AGTAACTTTG GGACCTAAAG GTCGCAATGT TGTTCTTGAT
AAAAAGTTTG GCTCACCTCA AATAACAAAT GACGGTGTAA CTATTGCTAG AGATATTGAT
TTAGAAGATA ATTATGAAAA TATGGGAGCT CAGTTGGTTA AAGAGGTTGC AACTCAAACC
AATGACGTAG CTGGAGACGG TACAACTACT GCAACTATTC TAGCACAGGC TATGGTTAAT
GAAGGAATTA AGAACGTAAC TGCTGGTGCC AACCCCATGA TAATCAGAAA AGGAATCCAA
AAAGCTGTCG ATAGAGCAGT TGAAGAGTTG CAGAAAAACG CTGTCAGTGT TGAAGATAAA
GAGTCCATTT CTCAAGTTGC TTCAATCTCA GCTAATGACG AAGAAGTAGG TAAGCTAATT
GCTGAAGCTA TGGAAAAAGT AGGCAAAGAT GGAGTAATCA CAGTAGAAGA ATCCAAGAGC
TTCAAAACTG ATTTAAACGT AGTTGAAGGT ATGCAGTTTG ATAGAGGATA TGTTTCACCT
TATATGGTAA CTGATAATGA AAAGATGGAA GCTCACTTAG AAGAGCCATA CATCTTGATT
ACCGATAAGA AGATCGGTAA TATCCAAGAG ATTTTACCAG TACTAGAAAA AATCGTTGAA
CAAGGTAAAG AAGTTCTGTT AATCGCAGAA GACATCGAAG GCGAAGCCTT GGCTACATTA
GTTGTTAACA AGTTGAGAGG AACCTTCACT TGTGTGGGAG TTAAAGCTCC TGGTTTTGGT
GACAGAAGAA AAGCTATGCT TGAAGATATT GCCGTACTTA CAGGTGGTCA GGTAATCAGT
GAAGATGTAG GACTTGAACT TAAGAATGCT GATATCAGCA TGTTAGGTCG AGCTAGACAG
GTCACCATCA CTAAAGACGA TACAACTATC GTAGATGGCT ATGGAAATGA AGAAGATATC
CAAAAACGAA TTACTCAATT AAGAACTCAA ATTGAAGAAA CTACCTCAGA TTTCGATAGA
GAAAAACTAG AAGAGCGCTT AGCTAAACTA GCTGGAGGAG TTGCAGTAGT TGAAGTTGGA
GCAGCTACTG AAACTGAAAT GAAAGAAAAG AAACTACGAA TTGAAGACGC ACTTAACTCT
ACTAGAGCTG CTGTGGAAGA AGGAATCGTA GCTGGTGGTG GAACAGCCCT AATTGATGTA
TTACCTTCCC TTGAAGAAGT TCAAGCGGAT GGTGACGAGT CTACAGGAGT TAGCATCGTT
AGACGTGCTC TAGAAGAACC TGTACGTCAA TTGGCACATA ACTCTGGTGC TGAGGGCTCC
ATAGTTGCTG AACAAGTTAA GCAAAAAGGA ACAAACATAG GATTTAATGC CCTTGAAAAC
GATTACACTA ATATGCTCGA TGCTGGTGTA GTTGATCCTA AGAAAGTTAC TAGAAGCGCC
CTTGAAAATG CAGGAAGTAT CGCTGCAATG TTCTTAACAA CTGAAGCAGT AGTAGCAGAT
CTACCAGATG AGGACGATAA CGATGATGGA GACATGGGCG GTGGCGCCCC AGGAATGGGA
GGCATGGGCG GAATGCCCGG AATGTAA
 
Protein sequence
MAKDIKFRED ARARLEQGVN KLADTLKVTL GPKGRNVVLD KKFGSPQITN DGVTIARDID 
LEDNYENMGA QLVKEVATQT NDVAGDGTTT ATILAQAMVN EGIKNVTAGA NPMIIRKGIQ
KAVDRAVEEL QKNAVSVEDK ESISQVASIS ANDEEVGKLI AEAMEKVGKD GVITVEESKS
FKTDLNVVEG MQFDRGYVSP YMVTDNEKME AHLEEPYILI TDKKIGNIQE ILPVLEKIVE
QGKEVLLIAE DIEGEALATL VVNKLRGTFT CVGVKAPGFG DRRKAMLEDI AVLTGGQVIS
EDVGLELKNA DISMLGRARQ VTITKDDTTI VDGYGNEEDI QKRITQLRTQ IEETTSDFDR
EKLEERLAKL AGGVAVVEVG AATETEMKEK KLRIEDALNS TRAAVEEGIV AGGGTALIDV
LPSLEEVQAD GDESTGVSIV RRALEEPVRQ LAHNSGAEGS IVAEQVKQKG TNIGFNALEN
DYTNMLDAGV VDPKKVTRSA LENAGSIAAM FLTTEAVVAD LPDEDDNDDG DMGGGAPGMG
GMGGMPGM