Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1423 |
Symbol | |
ID | 3773595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 1476059 |
End bp | 1477678 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637799855 |
Product | carbonate dehydratase |
Protein accession | YP_400440 |
Protein GI | 81300232 |
COG category | [R] General function prediction only |
COG ID | [COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0536708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGCC CAACAACGGT CCCCGTTGCT ACGGCGGGTC GGTTGGCTGA GCCTTATATT GATCCGGCTG CTCAGGTTCA TGCGATCGCC AGCATCATCG GCGACGTACG TATCGCAGCG GGAGTCCGCG TTGCAGCGGG GGTTTCGATC CGTGCTGACG AAGGCGCACC ATTCCAAGTC GGGAAAGAAA GCATCCTGCA AGAGGGCGCT GTCATCCACG GCTTGGAATA TGGTCGTGTC TTGGGCGATG ACCAAGCGGA CTATTCCGTC TGGATAGGCC AGCGAGTCGC GATTACTCAC AAAGCACTCA TCCATGGCCC GGCCTATCTC GGAGATGACT GCTTCGTCGG TTTCCGATCC ACCGTCTTCA ACGCTCGTGT TGGGGCCGGT TCGGTAATCA TGATGCACGC CCTTGTCCAA GACGTAGAGA TTCCTCCCGG TCGCTATGTT CCTTCTGGAG CAATCATCAC GACCCAGCAG CAGGCCGATC GCCTACCCGA GGTTCGCCCG GAAGATCGGG AATTTGCCCG CCACATCATT GGCTCACCTC CAGTGATTGT CCGGTCTACT CCAGCAGCTA CTGCTGATTT CCACTCCACG CCAACTCCTT CTCCACTTCG TCCATCGTCT AGCGAGGCAA CGACCGTGAG CGCTTATAAC GGCCAAGGCC GACTCAGTTC CGAAGTCATC ACCCAAGTCC GGAGTTTGCT GAACCAGGGC TATCGGATTG GGACGGAACA TGCGGACAAG CGCCGCTTCC GGACTAGCTC TTGGCAGCCC TGCGCGCCGA TTCAAAGCAC GAACGAGCGC CAGGTCTTGA GCGAACTGGA AAATTGTCTG AGCGAACACG AAGGTGAATA CGTTCGCTTG CTCGGCATCG ATACCAATAC TCGCAGCCGT GTTTTTGAAG CCCTGATTCA ACGGCCCGAT GGTTCGGTTC CTGAATCGCT GGGGAGCCAA CCGGTGGCAG TCGCTTCCGG TGGTGGCCGT CAGAGCAGCT ATGCCAGCGT CAGCGGCAAC CTCTCAGCAG AAGTGGTCAA TAAAGTCCGC AACCTCTTAG CCCAAGGCTA TCGGATTGGG ACGGAACATG CAGACAAGCG CCGCTTTCGG ACTAGCTCTT GGCAGTCCTG CGCACCGATT CAAAGTTCGA ATGAGCGCCA GGTTCTGGCT GAACTGGAAA ACTGTCTGAG CGAGCACGAA GGTGAGTACG TTCGCCTGCT GGGCATCGAC ACTGCTAGCC GCAGTCGTGT TTTTGAAGCC CTGATCCAAG ATCCCCAAGG ACCGGTGGGT TCCGCCAAAG CGGCCGCCGC ACCTGTGAGT TCGGCAACGC CCAGCAGCCA CAGCTACACC TCAAATGGAT CGAGTTCGAG CGATGTCGCT GGACAGGTTC GGGGTCTGCT AGCCCAAGGC TACCGGATCA GTGCGGAAGT CGCCGATAAG CGTCGCTTCC AAACCAGCTC TTGGCAGAGT TTGCCGGCTC TGAGTGGCCA GAGCGAAGCA ACTGTCTTGC CTGCTTTGGA GTCAATTCTG CAAGAGCACA AGGGTAAGTA TGTGCGCCTG ATTGGGATTG ACCCTGCGGC TCGTCGTCGC GTGGCTGAAC TGTTGATTCA AAAGCCGTAA
|
Protein sequence | MPSPTTVPVA TAGRLAEPYI DPAAQVHAIA SIIGDVRIAA GVRVAAGVSI RADEGAPFQV GKESILQEGA VIHGLEYGRV LGDDQADYSV WIGQRVAITH KALIHGPAYL GDDCFVGFRS TVFNARVGAG SVIMMHALVQ DVEIPPGRYV PSGAIITTQQ QADRLPEVRP EDREFARHII GSPPVIVRST PAATADFHST PTPSPLRPSS SEATTVSAYN GQGRLSSEVI TQVRSLLNQG YRIGTEHADK RRFRTSSWQP CAPIQSTNER QVLSELENCL SEHEGEYVRL LGIDTNTRSR VFEALIQRPD GSVPESLGSQ PVAVASGGGR QSSYASVSGN LSAEVVNKVR NLLAQGYRIG TEHADKRRFR TSSWQSCAPI QSSNERQVLA ELENCLSEHE GEYVRLLGID TASRSRVFEA LIQDPQGPVG SAKAAAAPVS SATPSSHSYT SNGSSSSDVA GQVRGLLAQG YRISAEVADK RRFQTSSWQS LPALSGQSEA TVLPALESIL QEHKGKYVRL IGIDPAARRR VAELLIQKP
|
| |