Gene Cmaq_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0007 
Symbol 
ID5710119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp19499 
End bp21175 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content49% 
IMG OID641274510 
Productthermosome 
Protein accessionYP_001539851 
Protein GI159040599 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAG CTCAACAACC TAAGAGTGGT GTTCCAGTAG CCATACTTAA GGAGGGTAGT 
TCAAGGACAA CTGGTGCGGA TGCCAGGAGG AGTAACATAA TGGCTGCTAA GGTTATAACA
GAGGTTCTTC AAACAAGCCT TGGACCCAGG GGAATGGATA AGTTACTCAT TGACGCCTTC
GGTGACGTGA CAATAACCGG TGATGGTGCA ACAATACTCA AGGAAATGGA GGTCCAGCAC
CCTGCCGCTA AGCTACTTGT TGAGGTGGCT AAGGCACAGG ACGCTGAGGT TGGTGACGGT
ACAACAACCG TGGTTGTACT GGCGGGTAAG CTCCTTGAGC AGGCTGAGAT ACTGCTGGAT
GAGGGTATTC ACCCAACTAT AATAATTGAC GGATTTAAGA AGGCGTTAGA CTTCATAAAC
TCAACCATAA CCGAGGTACC GAACCTAATA TACCCTGTTA ACTTAAGTAA TAGGGATGAG
GTTGCTAAGA TTGTTGCCAA CTCACTGAGC AGTAAGGTTG TTGCTGAGGC TAGGGATTAC
TTAGCCAAGA TAGTTGTTGA CGCCTCCTAC ATTGCCGCTG AGCAAACCAA CGGTAAGTAC
AACCTGGATT TAGATTGGGT TAAGGTTGAG AAGAAGAAGG GCCAGAGCCT ATACGAGACT
CAATTCATCC AGGGTATTGT ACTGGATAAG GAGGTTGTTC ACCCAGGGAT GCCTAAGAGG
ATTGAGAAGG CTAAGATAGC TGTCTTAGAT GCGCCACTTG AGATTGAGAA GCCTGAGTGG
ACCACTAAGA TATCGGTTTC ATCACCGCAG CAGATTAAGG CCTACCTTGA GGAGGAGGCT
AACATACTTA AGGGTTACGT TGATAAGCTT AAGGAGATAG GGGCCAACGT AGTTATTACG
CAGAAGGGTA TTGATGAGAC TGCTCAACAC TTCCTGGCTA AGGCCGGTAT TATGGCTGTT
AGGAGGGTTA AGAGGAGTGA CATTGAGAAG CTGGCTAAGG CAACTGGGGC TAGGATAGCC
ACCAGTATTA AGGATCTTAA GCCCGAGGAC CTCGGTACAG CTGGGTTAGT TGAGGAGAGG
AAGGTTGGTG AGGAGAAGAT GGTGTTCGTT GAGCAATGCC CCAACCCAAG GGCAGTCACA
ATACTCATTA GGGGTGCCGC GGACAGGGTG CTTGATGAGG CTGAGAGGTC GATAAACGAT
GCACTACACG TCACAAGGGA CCTATTCAGG GATCCGAGAA TAGTCCCAGG CGGTGGGGCA
TTTGAGATTG AGGTTGCTAG GAGGCTTAGG GAGTGGGGTA GGAAGCTTCC AGGTAAGGAG
CAGTTAGCGG TAATGAGGTA CGCCGAGGCC GTGGAGAAGG TTCCTGAAAT ACTAGCCCTA
ACCGCGGGCC TAGACCCAGT GGATGCAATA GCTGAATTAA GGAGTAGGCA TGATAAGGGT
GAGCTTGACG CCGGTGTGGA TGTGCTTGGA GGCAGGATAA CAAGGATGAG TGAATTAAAC
ATATGGGACC CATTAATAGT TAAGATGCAG GTACTGAGAA GCGCCACTGA GGCAGCGATA
ATGGTGCTTA GGATAGATGA CATAATAGCA GCCGGCCAGA CAAAGTCATC CACAGGTAAG
GGTAAGGCCG GTGAGGAGTC AAAGACCGGT GAGGAAGGCG GAACCAGTAG TGATTAA
 
Protein sequence
MSTAQQPKSG VPVAILKEGS SRTTGADARR SNIMAAKVIT EVLQTSLGPR GMDKLLIDAF 
GDVTITGDGA TILKEMEVQH PAAKLLVEVA KAQDAEVGDG TTTVVVLAGK LLEQAEILLD
EGIHPTIIID GFKKALDFIN STITEVPNLI YPVNLSNRDE VAKIVANSLS SKVVAEARDY
LAKIVVDASY IAAEQTNGKY NLDLDWVKVE KKKGQSLYET QFIQGIVLDK EVVHPGMPKR
IEKAKIAVLD APLEIEKPEW TTKISVSSPQ QIKAYLEEEA NILKGYVDKL KEIGANVVIT
QKGIDETAQH FLAKAGIMAV RRVKRSDIEK LAKATGARIA TSIKDLKPED LGTAGLVEER
KVGEEKMVFV EQCPNPRAVT ILIRGAADRV LDEAERSIND ALHVTRDLFR DPRIVPGGGA
FEIEVARRLR EWGRKLPGKE QLAVMRYAEA VEKVPEILAL TAGLDPVDAI AELRSRHDKG
ELDAGVDVLG GRITRMSELN IWDPLIVKMQ VLRSATEAAI MVLRIDDIIA AGQTKSSTGK
GKAGEESKTG EEGGTSSD