Gene Ava_C0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0144 
SymbolgroEL 
ID3678080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp173618 
End bp175255 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content51% 
IMG OID637715227 
Productchaperonin GroEL 
Protein accessionYP_320421 
Protein GI75812804 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAGA CAATTCTGTA CAAGGACACG GCCCGTTGGA CATTGGAAAA AGGCTTTGAC 
GCTCTGACCG AAGCCGTCGC TGTCACCCTC GGTCCCAAAG GGCGGAATAT AGTTTTGGAG
AAAAAATTTG GCGCTCCTCA AATCGTCAAT GATGGAGTAA CTATTGCCAA AGAAATTGAA
CTAGAAGATC CCGCTGAGAA TACCGGGATT TCCTTGCTAC GTCAAGCAGC CTCCAAGACA
AACGATGTCG CGGGTGACGG AACCACCACT GCAATTGTGC TGGCTCACGC GATGCTTAAA
GAAGGGCTAC GCAACGTTAC GGCTGGTGCC GATCCCCTGA CGGTGAAACG GGGGATTGAT
AAAGCTACAG AATTCGTCAT CGAAAAAATT CAAGAACACG CCCGTCCAAT TAAAGATTCT
AGGGACATTG AGCAGGTTGC AACCATTTCA GCGGGTAATG ACCCCGAAGT GGGTCGGATT
GTGGCTGAGG CGATGGAAAG AGTGGGCAAA GAAGGTGTAA TCTCTCTAGA AGAAGGCAGA
TCCACAACGA CCGAACTGGA AGTCACTGAA GGAATGCGCT TTGATAGAGG GTACGTTTCG
CCTTACTTTG TGACTGATCC AGAGCGAATG GAGGCAGTGC TGGAAGAACC CCACATTTTA
ATCACCGATC GCAAGATTAC AATGGTTCAA GATTTGGTGC CGATTCTGGA GCAAATTGCT
CGCACGGGCA AACCCCTACT AATCATTGCG GATGACATTG AAAAAGAAGC CTTGGCAACT
TTAGTTGTCA ACCATTTGCG CGGTGTACTG CGAGTCGCGG CGGTGAAGGC TCCCGGCTTT
GGCGCTCAAC GTAAAGCCGT GCTGGAAGAC ATCGCGGCGC TCACAGGTGG TCAGGTGATT
AGCGAAGATA CGGGCTTGAA GCTAGAAAAT GTCAGGTTAG AAATGCTTGG TAAAGCCCGG
CGAGCGATCG TTACCAAAGA CGACACAACC ATCGTGGCAG AAGGCAACGA AGAAGCTGTC
AAAGCTCGCA TCGAACAAAT TCGTCGCCAA ATCCAAGAAG TTGAATCCTC CTACGATAAG
GAAAAACTGC AACAACGGCT GGCGAAACTG GCAGGTGGTG TCGCGGTGAT TAAGGTCGGT
GCTGCCACCG AAACTGAACT CAAAGACCGC AAGCTACGTC TAGAAGATGC AATTAATGCC
ACCAAAGCCG CAGTTGAAGA AGGAATTGTG CCCGGTGGCG GTACAACAAT AGCACACATT
GCTCCCCAGC TTGAGGAATG GGCAAAGAGC CAAATGAAGG ATGAAGAGTT GACGGGCACT
ATGATTGTGG CTCGCGCCCT TTATGCTCCA CTACGTCGCA TTGCTGATAA TGCTGGTGCT
AATGGTGCGG TGATTGTGGA GCGCGTCCGT GAACTGCCCT TTGATGAGGG ATACGATGCC
GTTGCTAATA AATTTGTGAA TATGTTCGAG GCGGGCATTG TCGATCCAGC GAAGGTGACT
CGTAGTGCTT TGCAGAATGC TGCATCTATT GGTGGTATGG TGCTGACCAC TGAGGGTGTT
GTGGTAGAAA AACCTGACAA GCAGGCGAAA GCGCCAGCTG GTGTCGGCCC TGGCCCTGGC
GAAGGCTTCG ATTATTGA
 
Protein sequence
MVKTILYKDT ARWTLEKGFD ALTEAVAVTL GPKGRNIVLE KKFGAPQIVN DGVTIAKEIE 
LEDPAENTGI SLLRQAASKT NDVAGDGTTT AIVLAHAMLK EGLRNVTAGA DPLTVKRGID
KATEFVIEKI QEHARPIKDS RDIEQVATIS AGNDPEVGRI VAEAMERVGK EGVISLEEGR
STTTELEVTE GMRFDRGYVS PYFVTDPERM EAVLEEPHIL ITDRKITMVQ DLVPILEQIA
RTGKPLLIIA DDIEKEALAT LVVNHLRGVL RVAAVKAPGF GAQRKAVLED IAALTGGQVI
SEDTGLKLEN VRLEMLGKAR RAIVTKDDTT IVAEGNEEAV KARIEQIRRQ IQEVESSYDK
EKLQQRLAKL AGGVAVIKVG AATETELKDR KLRLEDAINA TKAAVEEGIV PGGGTTIAHI
APQLEEWAKS QMKDEELTGT MIVARALYAP LRRIADNAGA NGAVIVERVR ELPFDEGYDA
VANKFVNMFE AGIVDPAKVT RSALQNAASI GGMVLTTEGV VVEKPDKQAK APAGVGPGPG
EGFDY