Gene Synpcc7942_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2036 
Symbol 
ID3774255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2105522 
End bp2106526 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content56% 
IMG OID637800481 
Producthypothetical protein 
Protein accessionYP_401053 
Protein GI81300845 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.502892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAATC TAGACGTCGC GCTGCTACCG AATCACTGGT ACGCGATCGC TGCCAGTACT 
GATCTGGGTT CCACACCGAT CGCAGCGTCG CTACTCGATC AACAGCTGGT GGTCTATCGC
ACTACTGCTG GACAAGTTGT GGTGCTCGAC GATCGCTGCC CCCACCGTGG GGCTTCGCTA
GCCTGCGGTC AGGTCAAGGG CAACGCGATT GCCTGTCCCT ATCACGGTTG GCAATTTGAT
CTCGATGGTC ATTGCGCTCA GATTCCATCT CAGCAGGCTT CGGCACGAAT TCCCCAAGCG
GCAAAAGTGG CGAGCTATCC TGTCCAGGAG CGCTATGGCT TGATTTGGGT GTTCACCGGC
GATCGCGATC GGGCGGCACA AACGCCGTTG TGGGAACTGC CGGAATATGA CCAAGCCGGT
TGGCGGGTGG TTCAAGGTCA GTTCGATTGG GCGGCAGACT ATCGTCGCGT TACCGAAAAT
GGCATGGATG TGGCGCATTC ACCCTTCGTG CATGCCAATT CCTTTGGTGC TAGCGGCAAT
GAAGTGATCG CCGATTTTGA GTTGGAAAAG AGCGATCTCG GCGCCCAAAT CTGGATTCCG
ATCGAGCCGA AGGCGAACTA TCGTGGCAGC TTCAACCTGC TAGGACGCAA GCAAGAAACC
CCCAAGGCAG GGCGATCGGG GGCAGCCTTT CACTTACCGA ACATCACTCG TATCGATATT
GAATTCGGCA ACTTTCATTT GATCTTGGTC GGTATTCACC AGCCGATCTC GGCCACAACG
ACCCGTAGTC ACTGGCTCCA TGTCCGCAAT TTCTTGACGG CAGGCTGGGC GGATGGTGGG
ACACGCAAAC GCACCGCCAA GCTTTTTCAG GAAGATCAAA AGATCATTGA GGGGATTGCT
CCTCTGCGCG ATCGCAATGA AATTTCGGTT GCCTCCGATC GCCTGCAACT CTACTACCGC
CAGCTCTGGC AACAGCACCA TTCCTCCCTC GTCGCCCAAG GCTGA
 
Protein sequence
MANLDVALLP NHWYAIAAST DLGSTPIAAS LLDQQLVVYR TTAGQVVVLD DRCPHRGASL 
ACGQVKGNAI ACPYHGWQFD LDGHCAQIPS QQASARIPQA AKVASYPVQE RYGLIWVFTG
DRDRAAQTPL WELPEYDQAG WRVVQGQFDW AADYRRVTEN GMDVAHSPFV HANSFGASGN
EVIADFELEK SDLGAQIWIP IEPKANYRGS FNLLGRKQET PKAGRSGAAF HLPNITRIDI
EFGNFHLILV GIHQPISATT TRSHWLHVRN FLTAGWADGG TRKRTAKLFQ EDQKIIEGIA
PLRDRNEISV ASDRLQLYYR QLWQQHHSSL VAQG