Gene Synpcc7942_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1423 
Symbol 
ID3773595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1476059 
End bp1477678 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content58% 
IMG OID637799855 
Productcarbonate dehydratase 
Protein accessionYP_400440 
Protein GI81300232 
COG category[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0536708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGCC CAACAACGGT CCCCGTTGCT ACGGCGGGTC GGTTGGCTGA GCCTTATATT 
GATCCGGCTG CTCAGGTTCA TGCGATCGCC AGCATCATCG GCGACGTACG TATCGCAGCG
GGAGTCCGCG TTGCAGCGGG GGTTTCGATC CGTGCTGACG AAGGCGCACC ATTCCAAGTC
GGGAAAGAAA GCATCCTGCA AGAGGGCGCT GTCATCCACG GCTTGGAATA TGGTCGTGTC
TTGGGCGATG ACCAAGCGGA CTATTCCGTC TGGATAGGCC AGCGAGTCGC GATTACTCAC
AAAGCACTCA TCCATGGCCC GGCCTATCTC GGAGATGACT GCTTCGTCGG TTTCCGATCC
ACCGTCTTCA ACGCTCGTGT TGGGGCCGGT TCGGTAATCA TGATGCACGC CCTTGTCCAA
GACGTAGAGA TTCCTCCCGG TCGCTATGTT CCTTCTGGAG CAATCATCAC GACCCAGCAG
CAGGCCGATC GCCTACCCGA GGTTCGCCCG GAAGATCGGG AATTTGCCCG CCACATCATT
GGCTCACCTC CAGTGATTGT CCGGTCTACT CCAGCAGCTA CTGCTGATTT CCACTCCACG
CCAACTCCTT CTCCACTTCG TCCATCGTCT AGCGAGGCAA CGACCGTGAG CGCTTATAAC
GGCCAAGGCC GACTCAGTTC CGAAGTCATC ACCCAAGTCC GGAGTTTGCT GAACCAGGGC
TATCGGATTG GGACGGAACA TGCGGACAAG CGCCGCTTCC GGACTAGCTC TTGGCAGCCC
TGCGCGCCGA TTCAAAGCAC GAACGAGCGC CAGGTCTTGA GCGAACTGGA AAATTGTCTG
AGCGAACACG AAGGTGAATA CGTTCGCTTG CTCGGCATCG ATACCAATAC TCGCAGCCGT
GTTTTTGAAG CCCTGATTCA ACGGCCCGAT GGTTCGGTTC CTGAATCGCT GGGGAGCCAA
CCGGTGGCAG TCGCTTCCGG TGGTGGCCGT CAGAGCAGCT ATGCCAGCGT CAGCGGCAAC
CTCTCAGCAG AAGTGGTCAA TAAAGTCCGC AACCTCTTAG CCCAAGGCTA TCGGATTGGG
ACGGAACATG CAGACAAGCG CCGCTTTCGG ACTAGCTCTT GGCAGTCCTG CGCACCGATT
CAAAGTTCGA ATGAGCGCCA GGTTCTGGCT GAACTGGAAA ACTGTCTGAG CGAGCACGAA
GGTGAGTACG TTCGCCTGCT GGGCATCGAC ACTGCTAGCC GCAGTCGTGT TTTTGAAGCC
CTGATCCAAG ATCCCCAAGG ACCGGTGGGT TCCGCCAAAG CGGCCGCCGC ACCTGTGAGT
TCGGCAACGC CCAGCAGCCA CAGCTACACC TCAAATGGAT CGAGTTCGAG CGATGTCGCT
GGACAGGTTC GGGGTCTGCT AGCCCAAGGC TACCGGATCA GTGCGGAAGT CGCCGATAAG
CGTCGCTTCC AAACCAGCTC TTGGCAGAGT TTGCCGGCTC TGAGTGGCCA GAGCGAAGCA
ACTGTCTTGC CTGCTTTGGA GTCAATTCTG CAAGAGCACA AGGGTAAGTA TGTGCGCCTG
ATTGGGATTG ACCCTGCGGC TCGTCGTCGC GTGGCTGAAC TGTTGATTCA AAAGCCGTAA
 
Protein sequence
MPSPTTVPVA TAGRLAEPYI DPAAQVHAIA SIIGDVRIAA GVRVAAGVSI RADEGAPFQV 
GKESILQEGA VIHGLEYGRV LGDDQADYSV WIGQRVAITH KALIHGPAYL GDDCFVGFRS
TVFNARVGAG SVIMMHALVQ DVEIPPGRYV PSGAIITTQQ QADRLPEVRP EDREFARHII
GSPPVIVRST PAATADFHST PTPSPLRPSS SEATTVSAYN GQGRLSSEVI TQVRSLLNQG
YRIGTEHADK RRFRTSSWQP CAPIQSTNER QVLSELENCL SEHEGEYVRL LGIDTNTRSR
VFEALIQRPD GSVPESLGSQ PVAVASGGGR QSSYASVSGN LSAEVVNKVR NLLAQGYRIG
TEHADKRRFR TSSWQSCAPI QSSNERQVLA ELENCLSEHE GEYVRLLGID TASRSRVFEA
LIQDPQGPVG SAKAAAAPVS SATPSSHSYT SNGSSSSDVA GQVRGLLAQG YRISAEVADK
RRFQTSSWQS LPALSGQSEA TVLPALESIL QEHKGKYVRL IGIDPAARRR VAELLIQKP