Gene Tneu_1525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1525 
Symbol 
ID6166140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1356716 
End bp1358377 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content57% 
IMG OID641668683 
Productthermosome 
Protein accessionYP_001794895 
Protein GI171185976 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.360658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGG CGCAAGCCCC TAGGACAGGC GTGCCCGTCA TGATATTGAA AGAGGGGAGC 
CAGAGGACAA CCGGCGTAGA TGCACGCCGT TCTAACATCC AAGCGGCTAA GGTAATTGCG
GAGATACTCG CCACATCTCT AGGGCCTAGA GGGATGGACA AGATGCTCAT CGACGCATTT
GGTGACGTCA CAATAACCGG AGACGGCGCC ACAATCCTCA AGGAGATGGA GGTCCAGCAC
CCAGCCGCCA AACTCCTAAT CGAGGTGGCG AAGGCGCAAG ATGCAGAGGT GGGTGACGGC
ACAACCACTG TGGTTGTTCT AGCCGGCAAA CTGCTGGAGC TAGGCGAGGA GCTGTTGGAG
GAGGGCATCC ATCCAACCAT CGTGATCGAC GGTTACAAGA AAGCCGCAGA TTACGCCCTT
AAGGTAGCCG AGGAGTTCGC GAAGCCGATC GACCTCACAA AGGAGCAACT GCTTAAAGTC
GTCTCGAGCT CCCTCTCCTC CAAGGTGGTG GCGGAGACCA GAGACTACCT GGCGGGCCTC
GTGGTGGAGG CGGCTCTGCA GGCCGTTGAG ACAAGAGACG GGAAGCCCTA TCTCGACCTA
GACTGGATCA AAATCGAGAA GAAGAAGGGC AAATCCATCT ACGAGACCCA GCTAGTCAGA
GGCATCGTGC TAGACAAGGA GGTAGTTCAC CCAGGTATGC CTAAGCGTGT GACAAACGCC
AAAATCGCCA TCTTGGACGC CCCCCTGGAG ATCGAGAAGC CCGAGTGGAC CACGAAGATC
TCCGTCACAA GCCCTGACCA GATAAAGGCG TTTCTAGACC AGGAGGCCGA GATCCTGAAG
TCATACGTAG ACCACCTGGC CTCCATAGGG GCCAACGTTG TAATTACGCA GAAGGGAATC
GACGAGGTTG CTCAGCACTT CCTAGCGAAG AAGGGCATAA TGGCAATTAG AAGGGTCAAG
AGAAGCGACA TAGAGAAGCT CGCTAGAGCG ACTGGCGCCA AGATCATTAC CTCCATCAAG
GACGCCAAGC CTGAGGACCT AGGCACCGCC GGGCTTGTGG AGGAGAGGAA GGTGGGCGAG
GAGAAGATGG TGTTTGTGGA GAACATACCC AATCCGAGGG CCGTAACAAT ACTGGTCCGC
GGCGGTAGCG ACAGGATCCT TGACGAGGTC GAGAGGTCGC TCCAGGACGC CCTCCACGTC
GCACGCGACC TCTTCAGAGA GCCTAAGATA GTGCCCGGCG GCGGCGCCTT TGAAATAGAG
GTTAGTAGAA AGGTGAGGGA GTACGCCAGA AAGCTTCCAG GGAAGGAGCA GCTGGCCGCC
CTAAAATTCG CCGACGCTCT GGAGCACATA CCCACGATCT TGGCGTTGAC CGCGGGCCTT
GACCCCGTAG ACGCCATCGC CGAGCTGAGG CGTAGACATG ACAACGGCGA GTTCTCCGCT
GGTGTAGACG TACATGGGGG CAAGATAGCA GATATGGCGT CGCTCAACGT GTGGGACCCG
TTGATAGTTA AGAAGCAGGT GATAAAGTCG GCGGTGGAGG CCGCCATCAT GATACTCCGC
ATAGACGACA TAATTGCCGC CGGGGCTCCG AAGAAGGAGG AGAAGAAGGG CAAGAAGGGC
GGGGAGGAAG GCGAGGAGAA GAGCGAGACC AAATTCGACT AG
 
Protein sequence
MAQAQAPRTG VPVMILKEGS QRTTGVDARR SNIQAAKVIA EILATSLGPR GMDKMLIDAF 
GDVTITGDGA TILKEMEVQH PAAKLLIEVA KAQDAEVGDG TTTVVVLAGK LLELGEELLE
EGIHPTIVID GYKKAADYAL KVAEEFAKPI DLTKEQLLKV VSSSLSSKVV AETRDYLAGL
VVEAALQAVE TRDGKPYLDL DWIKIEKKKG KSIYETQLVR GIVLDKEVVH PGMPKRVTNA
KIAILDAPLE IEKPEWTTKI SVTSPDQIKA FLDQEAEILK SYVDHLASIG ANVVITQKGI
DEVAQHFLAK KGIMAIRRVK RSDIEKLARA TGAKIITSIK DAKPEDLGTA GLVEERKVGE
EKMVFVENIP NPRAVTILVR GGSDRILDEV ERSLQDALHV ARDLFREPKI VPGGGAFEIE
VSRKVREYAR KLPGKEQLAA LKFADALEHI PTILALTAGL DPVDAIAELR RRHDNGEFSA
GVDVHGGKIA DMASLNVWDP LIVKKQVIKS AVEAAIMILR IDDIIAAGAP KKEEKKGKKG
GEEGEEKSET KFD