Gene Apre_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1500 
Symbol 
ID8398312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1634602 
End bp1636224 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content41% 
IMG OID644995864 
Productchaperonin GroEL 
Protein accessionYP_003153242 
Protein GI257066986 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.233248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAG ATATTAAATT TTCATCTGAC GCAAGAAAGG GACTTGAAGC AGGAATTGAT 
AAATTAGCAA ATGCTGTAAA AGTAACTCTT GGACCAAAGG GACGTAACGT AGTTCTTGAT
AAGGCATATG GAGCACCAAC CATCACAAAC GACGGTGTAA CCATAGCCCA AGACATAGAA
CTTGAAGATA GATTTGAAAA TATGGGAGCT CAACTTGTAA AAGAAGTTGC TACAAAGACA
AATGACGTAG CAGGAGACGG AACTACAACA GCGACAGTTC TAGCTCACGC TATCATCAAA
GAAGGACTCA AAAACCTAGC AGCAGGAGCA AACCCAGTAG TACTTCAAAA GGGTCTAAAG
AAAGCAACTG ACGAAGTAGT TGACTATATC AAAGAAAACT CTAGAGAAGT AGAAGACAAA
CAAGCTATAG AAAACGTAGG AACAATCTCA TCAGCTGACC CAGAAATCGG TAAATTCATA
GCAGATGCTA TGGAAAAGGT TGGAAATGAT GGAGTAATCA CAGTAGAAGA ATCCAAAACA
ACAGATACCT ACCTAGACGT TGTAGAAGGA ATGCAATTTG ACAAGGGCTA TCTATCCCCA
TACATGGCAA CAGACAATGA AAAAATGATA GCTGACCTTG ACGATCCATA CATCCTTCTA
ACAGACAAGA AGATTTCAAA CATCCAAGAA ATCCTCCCAC TCCTAGAAGA AGTTGTTCAA
GCTTCCAAAC CACTTCTAAT CATAGCAGAT GACGTAGACG GTGAAGCTCT TACAACACTT
ATCCTAAACA AACTAAGAGG AACCTTCAAC GTAGTTGCAG TAAAGGCACC AGGCTATGGT
GATAGAAGAA AAGCTATGCT TGAAGATATT GCAATCTTAA CAGGAGCTAC AGTAGTAAGC
GAAGAGCTTG GTATGGACCT TAAAGATACA GCTATGGATA TGCTAGGATC TGCTAAGAAA
GTAAAAGTAG ACAAAGACAA CACAACCATA GTAGAAGGAA AAGGCGATAA GGCTAACCTT
GAAGAAAGAG TAGAAACAAT CCGCAAACAA ATCGAAACAG AAGATAGCGA ATACGAAAAA
GAAAAACTTC AAGAAAGAGT GGCCAAACTT GCTGGTGGAG TTGCAGTAAT CAACGTTGGA
GCTGCAACAG AAACTGAAAT GCAAGAGAAA AAATACAGAA TCGAAGACGC CCTATCAGCA
ACAAGAGCCG CAGTAGAAGA AGGTATAGTT GCAGGTGGAG GAGTTGTCCT AATCGGTGCA
ATCGAAAGAG TAGCTAAATT AGAAGAAAGC TTAAAGGCAG ATGAGAAGAC AGGTGCTCTA
ATCATCAAAA AAGCCCTAGA AGCTCCACTA AGACAAATCG TAGAAAACGC AGGCATGGAC
GGATCTGTAA TAGTAGAAAA GGTTAAAAAT TCTGCTAAGG ATGAAGGATA CGATGCCTAC
AACGACGAGT TCGTAAACAT GTTCGAAAAA GGAATCGTAG AACCAACCAA GGTAACAAGA
TCAGCCCTAC AAAACGCCGT TTCAGTTGCA GGAATGATCC TAACAACAGA AGCAGCAGTA
GCAGACATCC CAGAAGAAAA CCCAGCTCCA CAAATGCCAG CTGGCATGCC AGGAATGTAT
TAA
 
Protein sequence
MAKDIKFSSD ARKGLEAGID KLANAVKVTL GPKGRNVVLD KAYGAPTITN DGVTIAQDIE 
LEDRFENMGA QLVKEVATKT NDVAGDGTTT ATVLAHAIIK EGLKNLAAGA NPVVLQKGLK
KATDEVVDYI KENSREVEDK QAIENVGTIS SADPEIGKFI ADAMEKVGND GVITVEESKT
TDTYLDVVEG MQFDKGYLSP YMATDNEKMI ADLDDPYILL TDKKISNIQE ILPLLEEVVQ
ASKPLLIIAD DVDGEALTTL ILNKLRGTFN VVAVKAPGYG DRRKAMLEDI AILTGATVVS
EELGMDLKDT AMDMLGSAKK VKVDKDNTTI VEGKGDKANL EERVETIRKQ IETEDSEYEK
EKLQERVAKL AGGVAVINVG AATETEMQEK KYRIEDALSA TRAAVEEGIV AGGGVVLIGA
IERVAKLEES LKADEKTGAL IIKKALEAPL RQIVENAGMD GSVIVEKVKN SAKDEGYDAY
NDEFVNMFEK GIVEPTKVTR SALQNAVSVA GMILTTEAAV ADIPEENPAP QMPAGMPGMY