Gene Cyan8802_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1004 
Symbol 
ID8390313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1027381 
End bp1028691 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content42% 
IMG OID644979019 
Productcarbonic anhydrase 
Protein accessionYP_003136772 
Protein GI257058884 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3338] Carbonic anhydrase 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTT CTTTATCTAG GATATATAAC CCCCTAGGGC TGGTTAGTGC TGCCGCTCTC 
ACGTTCTGCT CTTTAGTCGC TGGATTAGCC ACTCCGGCTT TAGCGCAATT AGGGTCAATT
TCGGGCACTA AATTCAACGA TCTCAATCAA AATGGAATTA GAGAGCCCTT AGAACTGGGT
TTACCGGGAT GGCAAATTCA ATTAATTAAC TTTGATGGTG ATGTCATTGC AACCACCACC
ACGAACCTGT TTGGTAACTA CGAGTTTACC GGGTTAGCAC CGGGTCCCTA TGTGGTGCGG
GAAGTCATGC AACCTGGTTG GAAACAGACC CTACCAAACT TTATTGAAAG TATGCAATTG
GGTCAAGTCA ATGGAGGTTG GGACTATGAT GATCCTGACA ATGATTGGCC GCTAATTGCA
CCCGATGCTA ACGGCAACTT TCAATCCCCA ATCAATATCA CGGAAACACC TCCCATTGAT
TTAAGCGAAT ATATCACCAT TAACTATTCT GGGCAAAACC TAGATGAAGT TAAAAACTCC
GGCTATAACT TTGATGTGGA GTACTTTCCG AGTAATTTCA ATACCGTCGA TGTAGCCGGG
GAAACTTTTG AACTGTTGCA GTTCCATTTC CACTACGAAA GTGAACACGC CATTGATGGA
CTACTGTCGG ATATGGAGTT ACACTTCGTT AACCGTCATG AGGATGGAGG ACTGTCTGTT
CTTGGGTTAT TAGTCGAAGA AGGTAGTGAA AATTTGCCAT TAAAACCGCT TTTTGATGCC
ATTGACGCTC AACTAGATGC CAATGGAAGC TTGCCATCGA CTTTCACCTT ACCTCAAAAC
TTAAATATTG CCAGTATCTT CCCCAATAAT TTTGATGGTT GGTTCTACAA TGGCTCCTTA
ACCACACCCC CGGCAACCGA AGGTGTTAAC TGGTTTGTTT TTGAAACCCC CATTCAACTG
TCTACCGCAC AAATCGACAT TTTTCAAAAT TTTCTTAGCA GCATTGGTTT TACTCACAAC
AATCGACCAT TGCAAGATTT GAATGGAAGA CAGTTAAATG AACATACGCA TCAAGAAACT
CTGAATGGAG GTTCAATTTC TCAGCTTAAC TTTGGGAATG CTTTGGATTT AGGTCTTTTC
AGGTTCGCTC AGCTTAATTA TCAAGTTACC GTCAATGAAA ATGATGTTGT CGATCTGAAT
TTTGGTAGTA CTAACACGAC CATTCCTGAA CCCTCTTCTG TAATTGCCTT ATTCGGTTTG
TCTGGCGTTG GTTTACTTTC TCGGTTGAGA CAGAAGAAAG TTGAACGCTA A
 
Protein sequence
MNTSLSRIYN PLGLVSAAAL TFCSLVAGLA TPALAQLGSI SGTKFNDLNQ NGIREPLELG 
LPGWQIQLIN FDGDVIATTT TNLFGNYEFT GLAPGPYVVR EVMQPGWKQT LPNFIESMQL
GQVNGGWDYD DPDNDWPLIA PDANGNFQSP INITETPPID LSEYITINYS GQNLDEVKNS
GYNFDVEYFP SNFNTVDVAG ETFELLQFHF HYESEHAIDG LLSDMELHFV NRHEDGGLSV
LGLLVEEGSE NLPLKPLFDA IDAQLDANGS LPSTFTLPQN LNIASIFPNN FDGWFYNGSL
TTPPATEGVN WFVFETPIQL STAQIDIFQN FLSSIGFTHN NRPLQDLNGR QLNEHTHQET
LNGGSISQLN FGNALDLGLF RFAQLNYQVT VNENDVVDLN FGSTNTTIPE PSSVIALFGL
SGVGLLSRLR QKKVER