Gene PCC8801_1599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1599 
Symbol 
ID7102352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1673139 
End bp1675139 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content49% 
IMG OID643474671 
ProductCarbonate dehydratase 
Protein accessionYP_002371807 
Protein GI218246436 
COG category[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGTCC GCACAGCCGC GGCACCCCCG ACTCCTTGGT CGAAAACCCT AGCCGAACCC 
CAAATTGATG AGAGTGCCTA CGTCCATTCT TTCTCGAACG TCATTGGTGA TGTCAAAGTG
GGTGCTAATG TTCTGATTGC CCCAGGAACC TCGATTCGTG CCGATGAAGG AACCCCGTTT
TCCATTGGAG AATCCACCAA TATTCAAGAT GGAGTGGTCA TTCACGGTCT TGAACAAGGT
CGGGTCGTAG GAGATGATGG TCAAGAATAT TCGGTTTGGA TCGGAAAACA GGCCTGTATT
ACCCACATGG CACTGATTCA CGGTCCGGCT TATGTTGGGG ATGGCTGCTT TATTGGCTTT
CGCTCAACGG TATTTAACGC GAGAATTGGG CAAGGCTGTA TTGTCATGAT GCACGCCTTG
ATTCAAGATG TAGAGATTCC CCCAGGGAAA TACGTTCCCT CGGGGGCTGT CATTACAAAC
CAACAACAGG CGGATCGTCT GCCTGATGTC ACGGAGGGCG ATCGCGCTTT CGCCCACCAT
GTCGTAAAAA TCAACGAATC TCTGAGGGTG GGTTATCAGT GCGCTGAAAA CAACGCCTGT
ATTATGCCCA TTCGGGAGCA GTTGGAAAAG TCTATAAATG GCGTTAATGA GACTGATTAT
AGAAATTTGG TGACCAATAT GAGTTTAAGT CCAGAAATTG TAACCCAAGT ACGCTCGTTA
ATTTCCCAAG GCTATAGCAT CGGGGCAGAA CACGCCGATA AGCGTCGTTT TCGCGCCAAG
TCTTGGACAA CCTACGGAAC CTTCAAAGGA CGCGCTGATC AAGTATTAGC CTCCTTAGAA
GCCTGTTTAC AAGACTGTCA AGGGGAATAT GTCCGTCTAA TTGGGATTGA TACCCAAGCC
AAACGGCGCG TTCTCGAAGA AATCGTCCAA CGGCCTGACG ATACCCCCGG AACCCCCTCT
CGAATCACCA CCACCAAGAG CTATGGCAGC AACGGCCATA GCTCAAATAG TAGCAATGGC
AATGGCCATG GTGGACTAGC CTCCGATGTG GTGTCCCAAG TTCGGGCTCT GATCCATCAA
GGCTACAAAG TGGGAACCGA AGTGGCTAAC CAACGCCGTT TTAAAACGGG TTCTTGGTTA
ACTGGCCCCG CTATTAGTAG TCAACGAGAA GCCGACGTAA TACGGGCTTT AGACGGGATT
ATTGCTGAGC ATGGTGGGGA GTATGTTCGC CTGATTGGAA TCGACCCCAA CGCTAAAAAA
CGGGTAGCTG AGGTGATTAT TCACCGTCCA GGGGAAGGCT CGTCAGCCTC CTCTAATGGA
GCAGCCCCTT CTGCTAGTTA TGGTAATCGC TCTAGTGGCA GCAATGGCAG TTCTAGTGCG
GGATTAAGTG CTGAAACCCT TAATCAAGTA CGGGGTTTAT TATCTCAAGG CTACAAAATC
GGTACAGAAC ACGCTGATAA GCGTCGTTTT CGGACTAAAT CTTGGCAAAG CTGCGCTCCC
ATTGATAGTA ACCGCGAATC AGAAGTGATT GCCGCTTTAG AAGCTTGTTT AGCCGAACAC
CACGGGGAAT ACGTTCAGTT GATTGGGATT GATACCCAAG CCAAACGCCG TGTCTTAGAA
GCGATTATTC AACGTCCTGG AGAAGCCTCA AGCAATGGTG CTAGTCGTGC CTCTGCAACA
GCAGCTACCC CAAGCTATTC TAATGGTGCG AGTCAAGCGA GTAACAATAT TAGCCGCACG
AATCTCGATT CTGATGCGAT TAACCAAGTG CGATCGCTAC TTTCCCAAGG CTACAAAATT
GGCACGGAAC ACGCTGATAA ACGCCGTTTT CGGACTAAAT CTTGGCAAAG TTGTCAACCC
ATTGAGAGTA CCCGCGAATC AGAAGTGATT GCCGCGCTAG AAGCCTGTTT AGCCGAACAT
CAAGGGGAAT ATGTGCGCTT ACTCGGTATT GATACCGTAG CCAAACGCCG TGTTCTAGAA
ACCCTGATTC AACGTCCCTA A
 
Protein sequence
MVVRTAAAPP TPWSKTLAEP QIDESAYVHS FSNVIGDVKV GANVLIAPGT SIRADEGTPF 
SIGESTNIQD GVVIHGLEQG RVVGDDGQEY SVWIGKQACI THMALIHGPA YVGDGCFIGF
RSTVFNARIG QGCIVMMHAL IQDVEIPPGK YVPSGAVITN QQQADRLPDV TEGDRAFAHH
VVKINESLRV GYQCAENNAC IMPIREQLEK SINGVNETDY RNLVTNMSLS PEIVTQVRSL
ISQGYSIGAE HADKRRFRAK SWTTYGTFKG RADQVLASLE ACLQDCQGEY VRLIGIDTQA
KRRVLEEIVQ RPDDTPGTPS RITTTKSYGS NGHSSNSSNG NGHGGLASDV VSQVRALIHQ
GYKVGTEVAN QRRFKTGSWL TGPAISSQRE ADVIRALDGI IAEHGGEYVR LIGIDPNAKK
RVAEVIIHRP GEGSSASSNG AAPSASYGNR SSGSNGSSSA GLSAETLNQV RGLLSQGYKI
GTEHADKRRF RTKSWQSCAP IDSNRESEVI AALEACLAEH HGEYVQLIGI DTQAKRRVLE
AIIQRPGEAS SNGASRASAT AATPSYSNGA SQASNNISRT NLDSDAINQV RSLLSQGYKI
GTEHADKRRF RTKSWQSCQP IESTRESEVI AALEACLAEH QGEYVRLLGI DTVAKRRVLE
TLIQRP