Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_1624 |
Symbol | |
ID | 8390936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 1661952 |
End bp | 1663952 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644979617 |
Product | Carbonate dehydratase |
Protein accession | YP_003137366 |
Protein GI | 257059478 |
COG category | [R] General function prediction only |
COG ID | [COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGTCC GCACAGCCGC GGCACCCCCG ACTCCTTGGT CGAAAACCCT AGCCGAACCC CAAATTGATG AGAGTGCCTA CGTCCATTCT TTCTCGAACG TCATTGGTGA TGTCAAAGTG GGTGCTAATG TTCTGATTGC CCCAGGAACC TCGATTCGTG CCGATGAAGG AACCCCGTTT TCCATTGGAG AATCCACCAA TATTCAAGAT GGAGTGGTCA TTCACGGTCT TGAACAAGGT CGGGTCGTAG GAGATGATGG TCAAGAATAT TCGGTTTGGA TCGGAAAACA GGCCTGTATT ACCCACATGG CACTGATTCA CGGTCCGGCT TATGTTGGGG ATGGCTGCTT TATTGGCTTT CGCTCAACGG TATTCAATGC CAGAATTGGG CAAGGCTGTA TTGTCATGAT GCACGCCTTG ATTCAAGATG TAGAGATTCC CCCAGGGAAA TACGTTCCCT CGGGGGCTGT CATTACAAAC CAACAACAGG CGGATCGTCT GCCTGATGTC ACGGAGGGCG ATCGCGCTTT CGCCCACCAT GTCGTAAAAA TCAACGAATC TCTGAGGGTG GGTTATCAGT GCGCTGAAAA CAACGCCTGT ATTATGCCCA TTCGGGAGCA GTTGGAAAAG TCTATAAATG GCGTTAATGA GACTGATTAT AGAAATTTGG TGACCAATAT GAGTTTAAGT CCAGAAATTG TAACCCAAGT ACGCTCGTTA ATTTCCCAAG GCTATAGCAT TGGGGCAGAA CACGCCGATA AGCGTCGTTT TCGCGCCAAG TCTTGGACAA CCTACGGAAC CTTCAAAGGA CGCGCTGATC AAGTATTAGC CTCCTTAGAA GCCTGTTTAC AAGACTGTCA AGGGGAATAT GTCCGTCTAA TTGGGATTGA TACCCAAGCC AAACGGCGCG TTCTCGAAGA AATCGTCCAA CGGCCTGACG ATACCCCCGG AACCCCCTCT CGAATCACCA CCACCAAGAG CTATGGCAGC AACGGCCATA GCTCAAATAG TAGCAATGGC AATGGCCATG GTGGCCTAGC CTCCGATGTG GTGTCCCAAG TTCGAGCTCT GATCCATCAA GGCTACAAAG TGGGAACCGA AGTGGCTAAC CAACGCCGTT TTAAAACGGG TTCTTGGTTA ACTGGCCCCG CTATTAGTAG TCAACGGGAA GCCGATGTAA TACGGGCTTT AGACGGGATT ATTGCTGAGC ATGGTGGGGA GTATGTTCGC CTGATTGGAA TCGACCCCAA CGCTAAAAAA CGGGTAGCTG AGGTGATTAT TCACCGTCCA GGGGAAGGCT CGTCAGCCTC CTCTAATGGA GCAGCCCCTT CTGCTAGTTA TGGTAATCGC TCTAGTGGCA GCAATGGCAG TTCTAGTGCG GGATTAAGTG CTGAAACCCT TAATCAAGTA CGGGGTTTAT TATCTCAAGG CTACAAAATC GGTACAGAAC ACGCTGATAA GCGTCGTTTT CGGACTAAAT CTTGGCAAAG CTGCGCTCCC ATTGATAGTA ACCGCGAATC AGAAGTGATT GCCGCTTTAG AAGCTTGTTT AGCCGAACAC CACGGGGAAT ACGTTCAGTT GATTGGGATT GATACCCAAG CCAAACGCCG TGTCTTAGAA GCGATTATTC AACGTCCTGG AGAAGCCTCA AGCAATGGTG CTAGTCGTGC CTCTGCAACG GCAGCTACCC CAAGCTATTC TAATGGTGCG AGTCAAGCGA GTAACAATAT TAGCCGGACG AATCTCGATT CTGATGCGAT TAACCAAGTG CGATCGCTGC TTTCCCAAGG CTACAAAATT GGTACGGAAC ACGCTGATAA ACGCCGTTTT CGGACTAAAT CTTGGCAAAG TTGTCAACCC ATTGAGAGTA CCCGCGAATC AGAAGTGATT GCCGCGCTAG AAGCCTGTTT AGCCGAACAT CAAGGGGAAT ATGTGCGCTT ACTCGGTATT GATACCGTAG CCAAACGCCG TGTTCTAGAA ACCCTGATTC AACGTCCCTA A
|
Protein sequence | MVVRTAAAPP TPWSKTLAEP QIDESAYVHS FSNVIGDVKV GANVLIAPGT SIRADEGTPF SIGESTNIQD GVVIHGLEQG RVVGDDGQEY SVWIGKQACI THMALIHGPA YVGDGCFIGF RSTVFNARIG QGCIVMMHAL IQDVEIPPGK YVPSGAVITN QQQADRLPDV TEGDRAFAHH VVKINESLRV GYQCAENNAC IMPIREQLEK SINGVNETDY RNLVTNMSLS PEIVTQVRSL ISQGYSIGAE HADKRRFRAK SWTTYGTFKG RADQVLASLE ACLQDCQGEY VRLIGIDTQA KRRVLEEIVQ RPDDTPGTPS RITTTKSYGS NGHSSNSSNG NGHGGLASDV VSQVRALIHQ GYKVGTEVAN QRRFKTGSWL TGPAISSQRE ADVIRALDGI IAEHGGEYVR LIGIDPNAKK RVAEVIIHRP GEGSSASSNG AAPSASYGNR SSGSNGSSSA GLSAETLNQV RGLLSQGYKI GTEHADKRRF RTKSWQSCAP IDSNRESEVI AALEACLAEH HGEYVQLIGI DTQAKRRVLE AIIQRPGEAS SNGASRASAT AATPSYSNGA SQASNNISRT NLDSDAINQV RSLLSQGYKI GTEHADKRRF RTKSWQSCQP IESTRESEVI AALEACLAEH QGEYVRLLGI DTVAKRRVLE TLIQRP
|
| |