Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_1004 |
Symbol | |
ID | 8390313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 1027381 |
End bp | 1028691 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644979019 |
Product | carbonic anhydrase |
Protein accession | YP_003136772 |
Protein GI | 257058884 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3338] Carbonic anhydrase |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACTT CTTTATCTAG GATATATAAC CCCCTAGGGC TGGTTAGTGC TGCCGCTCTC ACGTTCTGCT CTTTAGTCGC TGGATTAGCC ACTCCGGCTT TAGCGCAATT AGGGTCAATT TCGGGCACTA AATTCAACGA TCTCAATCAA AATGGAATTA GAGAGCCCTT AGAACTGGGT TTACCGGGAT GGCAAATTCA ATTAATTAAC TTTGATGGTG ATGTCATTGC AACCACCACC ACGAACCTGT TTGGTAACTA CGAGTTTACC GGGTTAGCAC CGGGTCCCTA TGTGGTGCGG GAAGTCATGC AACCTGGTTG GAAACAGACC CTACCAAACT TTATTGAAAG TATGCAATTG GGTCAAGTCA ATGGAGGTTG GGACTATGAT GATCCTGACA ATGATTGGCC GCTAATTGCA CCCGATGCTA ACGGCAACTT TCAATCCCCA ATCAATATCA CGGAAACACC TCCCATTGAT TTAAGCGAAT ATATCACCAT TAACTATTCT GGGCAAAACC TAGATGAAGT TAAAAACTCC GGCTATAACT TTGATGTGGA GTACTTTCCG AGTAATTTCA ATACCGTCGA TGTAGCCGGG GAAACTTTTG AACTGTTGCA GTTCCATTTC CACTACGAAA GTGAACACGC CATTGATGGA CTACTGTCGG ATATGGAGTT ACACTTCGTT AACCGTCATG AGGATGGAGG ACTGTCTGTT CTTGGGTTAT TAGTCGAAGA AGGTAGTGAA AATTTGCCAT TAAAACCGCT TTTTGATGCC ATTGACGCTC AACTAGATGC CAATGGAAGC TTGCCATCGA CTTTCACCTT ACCTCAAAAC TTAAATATTG CCAGTATCTT CCCCAATAAT TTTGATGGTT GGTTCTACAA TGGCTCCTTA ACCACACCCC CGGCAACCGA AGGTGTTAAC TGGTTTGTTT TTGAAACCCC CATTCAACTG TCTACCGCAC AAATCGACAT TTTTCAAAAT TTTCTTAGCA GCATTGGTTT TACTCACAAC AATCGACCAT TGCAAGATTT GAATGGAAGA CAGTTAAATG AACATACGCA TCAAGAAACT CTGAATGGAG GTTCAATTTC TCAGCTTAAC TTTGGGAATG CTTTGGATTT AGGTCTTTTC AGGTTCGCTC AGCTTAATTA TCAAGTTACC GTCAATGAAA ATGATGTTGT CGATCTGAAT TTTGGTAGTA CTAACACGAC CATTCCTGAA CCCTCTTCTG TAATTGCCTT ATTCGGTTTG TCTGGCGTTG GTTTACTTTC TCGGTTGAGA CAGAAGAAAG TTGAACGCTA A
|
Protein sequence | MNTSLSRIYN PLGLVSAAAL TFCSLVAGLA TPALAQLGSI SGTKFNDLNQ NGIREPLELG LPGWQIQLIN FDGDVIATTT TNLFGNYEFT GLAPGPYVVR EVMQPGWKQT LPNFIESMQL GQVNGGWDYD DPDNDWPLIA PDANGNFQSP INITETPPID LSEYITINYS GQNLDEVKNS GYNFDVEYFP SNFNTVDVAG ETFELLQFHF HYESEHAIDG LLSDMELHFV NRHEDGGLSV LGLLVEEGSE NLPLKPLFDA IDAQLDANGS LPSTFTLPQN LNIASIFPNN FDGWFYNGSL TTPPATEGVN WFVFETPIQL STAQIDIFQN FLSSIGFTHN NRPLQDLNGR QLNEHTHQET LNGGSISQLN FGNALDLGLF RFAQLNYQVT VNENDVVDLN FGSTNTTIPE PSSVIALFGL SGVGLLSRLR QKKVER
|
| |