Gene Synpcc7942_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2034 
Symbol 
ID3774253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2103270 
End bp2104508 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID637800479 
Producthypothetical protein 
Protein accessionYP_401051 
Protein GI81300843 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.906883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATTG TCATTCTGAC GATTGGGACG CGCGGTGACG TCCAACCGTT TATGGCGCTG 
GGTTTGGGGC TGAAAGCGGC GGGCTATGAG GTGGCGATCG CCACGCAAGC CAACTATCAG
TCGATGGTGG AAGGGCTGGG GCTGGAGTTT CGGTTGCTGG CGGGTGATCC CCAAGGGGTG
CAGCAGCAAT CAGGCGCTTA CTCCAAGGAA ACGGTCGCGG CGGCAGCGCA GCTGCTAGGC
CAGATTCTCA AGGATTCTTG GGCGGCTTGT CAGGATGCGG ATGCGATCGT GGCTTCGCCG
AATGCGCGGG GTGCGACTCA TATTGCCGAA GCGCTGAAGA TTCCTTGCTT TCTGGGATCG
CCCACGCCCT ACGGGTTTAC CCAAGCCTTT GCGAGCCCTT GGTTTCCGCC GAACTTCATG
CTGGGAGGTG GCTGGGGCAA TTGGCTCAGT CACTATGCCG TCGATAAATT GCTCTGGGTG
GCGACTCGCA AGACGGTCAA CGAGTGGCGC ATTTCTGATC TAGGACTGAA GCCCTTGAGT
TGGAGCAGTC CTTACAAACA GCTGGTGCGC AGAGGGCAAG TCTTCTTGCA TCCACTCAGT
GAAGTGACCT TGCCGAAACC TGCAGACTGG CCAGAGCAAG CGCATCTGAC GGGTTATTGG
CTGCTACCGG AAGCTGAGGC AACGCTCTCA CCCGAACTGG AAGCCTTTCT AGCAGCGGGT
GAGCCGCCGG TGTTCATTGG CTTTGGCAGC ATGGTCGACC AAGAACCGGA GCGGTTGACC
GCGATCGCAG TCGAAGCGCT GCAGAAAAGT AATCAGCGGG GCATTTTGCT AGCAGGCTGG
AGCCGGATCG ACCGCTCTCA GCTACCAGAC ACGGTGTTTC CACTAGAGTC CGCGCCCTTT
GGCCTGCTGT TTCCGCGCCT CGCAGCGGCA GTGCATCACG GTGGTTGTGG TACCACGGCA
GCGAGTTTGC AGGCAGGGTT GCCAACAATC ATCACGGCCT ACGGCAATGA CCAAGCCTTT
TGGGGCAAGC GGGTCGCAGA ACTAGGGGCA GGGCCATCCC CAATTACCCG CGAGGGTTTG
ACGGCTGAGA CTCTGGCGAC TGCGATCGCC CAAGCCGTCA GCGATCCGCA AATGCGATCG
CGGGCGCAGG CGATCGGGGA ACGGCTACGG GCAGAGAATG GGGTTTCTAA AGCAGTGAAA
CTGCTGGGTG ACTACTTAGC GGCGGGCACC AGTTCCTGA
 
Protein sequence
MRIVILTIGT RGDVQPFMAL GLGLKAAGYE VAIATQANYQ SMVEGLGLEF RLLAGDPQGV 
QQQSGAYSKE TVAAAAQLLG QILKDSWAAC QDADAIVASP NARGATHIAE ALKIPCFLGS
PTPYGFTQAF ASPWFPPNFM LGGGWGNWLS HYAVDKLLWV ATRKTVNEWR ISDLGLKPLS
WSSPYKQLVR RGQVFLHPLS EVTLPKPADW PEQAHLTGYW LLPEAEATLS PELEAFLAAG
EPPVFIGFGS MVDQEPERLT AIAVEALQKS NQRGILLAGW SRIDRSQLPD TVFPLESAPF
GLLFPRLAAA VHHGGCGTTA ASLQAGLPTI ITAYGNDQAF WGKRVAELGA GPSPITREGL
TAETLATAIA QAVSDPQMRS RAQAIGERLR AENGVSKAVK LLGDYLAAGT SS