Gene Tneu_1323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1323 
Symbol 
ID6165170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1184263 
End bp1185909 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content57% 
IMG OID641668479 
Productthermosome 
Protein accessionYP_001794696 
Protein GI171185777 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.678735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG CGGTGTTAAC CCAGATAGGC GGCGTGCCAG TGCTAGTGTT GAAGGAGGGC 
ACGCAGAGGG CTTTCGGCAA GGAGGCGCTC AGGCTTAACA TAATGATAGC CAGAGCAATT
TCCGAGGTAA TGAGGACGAC GCTGGGTCCA AAGGGTATGG ACAAAATGCT CATCGACTCC
CTCGGCGATA TAACGATCAC GAACGACGGC GCGACGATCT TAGACGAGAT GGACGTACAA
CACCCCATAG CGAAGCTGCT CGTAGAGATC TCGAAGTCTC AGGAAGAGGA GGCTGGAGAT
GGCACCACGT CGGCGGTTGT CCTCGCCGGG GCTCTCCTCG AGGAGGCTGA AAAGCTTCTC
GATAAGAACA TCCACCCGAC GGTGATAGTA AGCGGATTTA AGAAGGCGCT TGACGTGGCG
ACTGAGCACC TACGCAAGGT CGCCGTCCCC GTGAACAGAA ACGACGCCGA TACCCTGAAG
AAGATCGCGA TGACGTCCAT GGGAGGCAAA ATATCCGAAA CTGTGAAGGA GTACTTCGCC
GACTTGGCCG TGAAGGCGGT GCTCCAGGTC GCCGAGGCCA GAGATGGGAA ATACTACGTC
GACCTTGACA ACATCCAGAT AGTGAAGAAG CACGGCGCTT CGCTCCTCGA CACACAGCTG
GTATACGGCG TTATCGTGGA TAAGGAGGTC GTCCACGCCG CCATGCCTAA ACGCGTGGTA
AACGCCAAGA TAGCGCTACT TGACGCCCCG TTGGAGGTGG AGAAGCCTGA GATAGACGCC
GAGATTAGAA TAAGCGACCC GCTTCAGATG AAGGCCTTCC TTGAGGAGGA GGAGAAGATC
CTGAAGGGGT ATGTGGACAA GCTCAAGGCT TTAGGCGTAA CCGCTCTGTT TACCACCAAG
GGCATAGACG ACATCGCGCA GTACTACCTA GCCAAGGCGG GTATTCTCGC CGTGAGGAGG
GTCAAGCGTA GCGACATAGA GAAGCTCGTC AGAGCAACGG GAGGGAGACT TGTCACAAGC
ATTGAGGATC TCACCGAGGC CGACTTGGGC TTCGCCGGGC TTGTCGAGGA GAGGCGCGTC
GGCGATGAGA AGATGGTGTT TGTTGAGCAG TGCAAGAACC CGAGGGCCGT CTCCATACTT
GTGAGAGGCG GCTTCGAGAG GCTTGTGGAC GAGGCCGAGA GAAACCTCGA CGATGCGCTC
TCCGTCGTGG CCGACGTCGT GGAAGAGCCA TACATACTAC CCGCCGGAGG AGCCGCCGAG
ATCGAAGCTG CTAAGGCGGT GAGGGCCTTC GCCACAAAGG TGGGTGGGAG AGAGCAGTAC
GCGGTTGAGG CCTTCGCAAG AGCTCTTGAG GCTATACCGA AGGCCCTCGC CGAGAACGCA
GGTCTCGACC CCATCGACAT ATTGACTGAG CTCACACACA AGCACGAGCA GGCCGACGGC
TGGAAGTACG GCCTCGACGT GTACCAAGGC AAGGTGGTAG ACATGGCGGC GCTTGGCCTC
ATAGAGCCTC TGACAGTTAA GCTCAACGCT CTAAAGGTAG CGGTAGAGGC TGCGTCGATG
ATCCTCAGGA TCGACGAGAT TATAGCCGCA TCTAAGCTAG AGAAGGAGGA GAAGAAGGAG
GAGGGAAAGA AGGAGGAATC TGACTAG
 
Protein sequence
MSQAVLTQIG GVPVLVLKEG TQRAFGKEAL RLNIMIARAI SEVMRTTLGP KGMDKMLIDS 
LGDITITNDG ATILDEMDVQ HPIAKLLVEI SKSQEEEAGD GTTSAVVLAG ALLEEAEKLL
DKNIHPTVIV SGFKKALDVA TEHLRKVAVP VNRNDADTLK KIAMTSMGGK ISETVKEYFA
DLAVKAVLQV AEARDGKYYV DLDNIQIVKK HGASLLDTQL VYGVIVDKEV VHAAMPKRVV
NAKIALLDAP LEVEKPEIDA EIRISDPLQM KAFLEEEEKI LKGYVDKLKA LGVTALFTTK
GIDDIAQYYL AKAGILAVRR VKRSDIEKLV RATGGRLVTS IEDLTEADLG FAGLVEERRV
GDEKMVFVEQ CKNPRAVSIL VRGGFERLVD EAERNLDDAL SVVADVVEEP YILPAGGAAE
IEAAKAVRAF ATKVGGREQY AVEAFARALE AIPKALAENA GLDPIDILTE LTHKHEQADG
WKYGLDVYQG KVVDMAALGL IEPLTVKLNA LKVAVEAASM ILRIDEIIAA SKLEKEEKKE
EGKKEESD