Gene Synpcc7942_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2035 
Symbol 
ID3774254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2104481 
End bp2105518 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content58% 
IMG OID637800480 
Producthypothetical protein 
Protein accessionYP_401052 
Protein GI81300844 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCTCA AGAACTACTG GTATCCCGTG GCGCTGGCTC AGAAGGTCGG CGATCGCCCC 
CTGTCTGTCA CTCTCTGCGG TGAAGCGATC GCCCTCTACC GCGACAGTGC GGGTCAAATT
CACGCCTTGA GCGATCGCTG TGTGCATCGG GGTGCAGCAC TTTCCGGCGG CTGGGTTGAA
AATGACTGTC TCGTCTGTCC GTACCACGGT TGGCAGTACG ACGCCCAAGG GCACTGCCGC
AAAATTCCTG CCAATACGGA GCAACAGCGC ATTCCCTTTG CAGCCAAAGT TCCCCACTAT
GATGCGATCG AACGCTACGG CCTAGTCTGG CTGTTCTACG GCGATCTACC TGAAGCGGAT
CGTCCACCCT TGCCGCCCTT GCCGGAATAC GACGATCCAG CCTGGCGCAC CGTGCAGGGT
GAAGTGACCT ACACCACCCA CTACACCCGC GTCACCGAAA ACCTGATGGA TTTCGCCCAT
GCACCCTTCA CCCACTCGGG TTCGTTTGGG GCAGCGTCCG ATCCATTAAT TGAGCCTTAC
AAAGTCGAAC AACTTCCAGA CGGTCTACGC GCCCAAACCC AGTTCACCAA ATCGGCCTAT
CGCGGCATTT GGAAGCTGTT CAATCGTGGC GATGCCCCAC GTACCGTCAC CACCACAATC
ACCCTTTACA TGCCCTGCAT CGTCCGCACC GAAACGGACT TAGGCAACGG CTTCCGCTTC
ATTGGCTACG GTGCCAATCT GCCGATCGAT GCCGAGACCA CCAAGACCTT TTGGCTGACC
GTGCGCACCT TCTTTACCGG TGCTTGGGCG GATGGCGACA CGGTGCGCCG CAGTCTCAAA
ATCATCGAAG AGGACAAACG GATCGTCGAA ACCCAGCGTC CCAAGATGAT TCCCTTGGAC
GATCGCAGTG AAACCCACGT CGCCGCTGAT GCCCTACAAA TCGGCTACCG CAACTTGCTG
CGACAAGCCC GCGATCGCGG TTGGGCAATT GCCGAATCTC AGCCAGCCGA TCAGGAACTG
GTGCCCGCCG CTAAGTAG
 
Protein sequence
MFLKNYWYPV ALAQKVGDRP LSVTLCGEAI ALYRDSAGQI HALSDRCVHR GAALSGGWVE 
NDCLVCPYHG WQYDAQGHCR KIPANTEQQR IPFAAKVPHY DAIERYGLVW LFYGDLPEAD
RPPLPPLPEY DDPAWRTVQG EVTYTTHYTR VTENLMDFAH APFTHSGSFG AASDPLIEPY
KVEQLPDGLR AQTQFTKSAY RGIWKLFNRG DAPRTVTTTI TLYMPCIVRT ETDLGNGFRF
IGYGANLPID AETTKTFWLT VRTFFTGAWA DGDTVRRSLK IIEEDKRIVE TQRPKMIPLD
DRSETHVAAD ALQIGYRNLL RQARDRGWAI AESQPADQEL VPAAK