Gene Apar_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0184 
Symbol 
ID8413032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp215060 
End bp216697 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content53% 
IMG OID645021756 
Productchaperonin GroEL 
Protein accessionYP_003179211 
Protein GI257783994 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAG ATATCGATTT TGGCACCGAC GCTCGTGCCA AGCTTGCTAA GGGCGTAAAC 
ACCCTGGCAG ATGCCGTAAC CACCACCCTT GGACCTAAGG GTCGCTACGT TGCACTGCAG
CGTTCTTATG GTGCACCAAC TATTACCAAC GACGGCGTTT CCGTTGCTCG CGAGATTGAG
CTTAAGGACC CAGTAGAGAA CATGGGTGCA CAGCTGGTCA AAGAGGTTGC AACCAAGACT
AACGATACTG TCGGTGATGG TACCACCACC GCAACCTTGC TTGCACAGGT TATTGTTAAC
GAGGGTCTTC GTAATGTTGC AGCTGGTGCA AATCCAATTG CAATTCGCCG CGGCATCGAC
AAGGCTGTTG AGGCTGTTGT CTCCCAGATG AAGGGTATTG CTAAGGAGGT CTCAACCAAG
CAGCAGATTG CTTCCGTTGG TACCATTTCT GCAGGTGACC CTGTCATTGG TGGCAAGATT
TCAGATGCTA TGGACGTTGT CGGCAAGGAT GGCGTCATTA CTGTTGAGGA GTCTCAGACC
TTTGGTATTG ACATCGACAC CGTTGAGGGT ATGCAGTTTG ACAAGGGCTA TGTTTCCCCT
TACTTCGCAA CCAACAACGA GACTCTCACC GCTGAGCTGG ACAATCCTTA CATTCTGATG
ACCGATAGCA AGATCTCCTC CATCCAGGAC ATCCTGCCTA TCCTTGAGGC TGTCCAGAAG
CAGGGCGCAC CTCTGCTCAT CATGGCTGAG GACGTTGACG GCGAGGCTTT GACCACCCTC
ATCCTTAACA AGCTTCGTGG CGTTCTGAAC GTCTGCGCAA TTAAGGCTCC TGCATACGGC
GACCGTCGTA AGCGTATGCT TGAGGACATT GCTGTTCTCA CTGGTGGTCA GGCAGTCATC
AAGGAGCTCG GTGTTAACCT CAATGAGATT ACCGCTGACA TGCTCGGCCG CGCTAAGTCC
GTCAAGGTTA CCAAGGAGAC CACTACCATC GTTGGTGGCG CTGGTTCCAA GGACGCAATT
GACGAGCGTA TTGCTCAGAT TAAGGCTGAG ATTGACAACA CCACTTCTGA CTTTGATCGC
GAGAAGCTCC AGGAGCGTCT TGCTAAGCTT GCTGGCGGCG TTGCCGTTAT CAAGGTTGGT
GCAGCTACTG AGGTTGAGCT CAAGGAGATT AAGCACCGCA TCGAGGACGC ACTTCAGGCA
ACTCGCGCAG CTGTCGAGGA GGGTATTGTT GCAGGCGGCG GCGTTTCCTT CCTGGCTGCA
TCTTCTGTTT TGGATAGCGT TCAGACCTCT GATGCAGACG AGAAGATTGG TGTTGAAATC
ATCCGTAAGG CACTTGAGGC TCCAGTTCGC ACCATCGCTA ACAATGCTGG TTTCGAGGGC
AGCGTAGTTG TCGAGAAGAT CAAGGCACTC CCAGCAGGTC AGGGTCTTGA TTCTGCAACA
GGTCAGTATG GCGACATGAT TGAGATGGGC GTTCTTGACC CAGTTAAGGT CACCCGCACC
ACGCTTCAGA ATGCAGCTTC CGTTGCATCC CTCATCCTCA TCACTGAGGC AACTGTTTCC
GAGATGCCAA AGGACACCAC CATCGAGGAG TCCATTTCTC GTGCAGCTGC TCAGGGCGGC
CAGGGCGGCA TGTACTAA
 
Protein sequence
MAKDIDFGTD ARAKLAKGVN TLADAVTTTL GPKGRYVALQ RSYGAPTITN DGVSVAREIE 
LKDPVENMGA QLVKEVATKT NDTVGDGTTT ATLLAQVIVN EGLRNVAAGA NPIAIRRGID
KAVEAVVSQM KGIAKEVSTK QQIASVGTIS AGDPVIGGKI SDAMDVVGKD GVITVEESQT
FGIDIDTVEG MQFDKGYVSP YFATNNETLT AELDNPYILM TDSKISSIQD ILPILEAVQK
QGAPLLIMAE DVDGEALTTL ILNKLRGVLN VCAIKAPAYG DRRKRMLEDI AVLTGGQAVI
KELGVNLNEI TADMLGRAKS VKVTKETTTI VGGAGSKDAI DERIAQIKAE IDNTTSDFDR
EKLQERLAKL AGGVAVIKVG AATEVELKEI KHRIEDALQA TRAAVEEGIV AGGGVSFLAA
SSVLDSVQTS DADEKIGVEI IRKALEAPVR TIANNAGFEG SVVVEKIKAL PAGQGLDSAT
GQYGDMIEMG VLDPVKVTRT TLQNAASVAS LILITEATVS EMPKDTTIEE SISRAAAQGG
QGGMY