Gene Synpcc7942_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1799 
Symbol 
ID3774374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1868940 
End bp1869971 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content56% 
IMG OID637800240 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_400816 
Protein GI81300608 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0201119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGGG ACTGTCCGCT ACCTCTGACC GGCAGCGAAC GCATTCAGAT GGCCCACGGT 
AGCGGGGGTC GCCAAATGAA TCAGCTGATT GAGACAGTTT TTCGATCAGT TTTTGAGACA
GAGCAAACGG TTGCAGCGGA TGCTGCTGTC CTAACTTTAC CGAGCGATCG CATCGCCGTC
AGCACGGATA GTTTTGTCAT CCAACCACTG TTTTTCCCCG GTGGCGACAT CGGCTCTCTG
GCCGTGCATG GCACAGTCAA TGATTTATTG ATGCGGGGCG CAATTCCTCA CCGGTTGACC
GCCAGCTTCA TCCTCGAAGA AAGCTTGGCG ATCGCAACGC TCAACACACT GGTGCAGTCG
ATGGCTAGCG CAGCCCAAGC CGCTGGGGTT CAGATTTGCG CCGGCGATAC CAAAGTGGTT
GAGCGCGGGA AGGCTGATGG CCTTTACATC ACGACAACGG GCATTGGCTG GTTGCCGCCC
GATCGCCAGA TTGGTCCGCA ACAAATCCAG GTAGGCGACG CGATCGTGAT CAATGGCGAT
CTTGGGCGAC ATGGCATTGC CGTCATGGCC TGCCGCGAAG GGCTGGAGTT AGAGCTGCCC
TTCGCTAGTG ATGTCGCAAG TTTGCGATCG CCCGTCTTGG GTTTGCTGGA AGCAGGCATC
GATCTTCATT GTCTACGGGA CCTGACGCGA GGCGGCTTAG CCAGTGCCCT CAATGAATTA
GCAGCCGTCG GGGTTGGGAT GGATATCGAG GCTAAGGCGA TTCCAGTCGA TCCAGCAGTT
GCAGGTGTCT GTGAAATCCT CGGTTTAGAC CCACTACAGG TTGCCAATGA AGGCCGATTT
GTGGCGTTTC TACCCACCGA TCAAGTGGCG ATCGCGCTAC AAATCTTGCG CCAAATTTGG
CCAGAACAGG AACCTCGTTG CATTGGCATC GTCCAATCAG CTACGCCAGG TCTCGTGCGG
TTGCGTCAGC CCTTTGGCAG CGATCGCCTG TTGGAAATGC TCAGTGGCGA ACAGCTACCC
CGCATTTGCT AG
 
Protein sequence
MSWDCPLPLT GSERIQMAHG SGGRQMNQLI ETVFRSVFET EQTVAADAAV LTLPSDRIAV 
STDSFVIQPL FFPGGDIGSL AVHGTVNDLL MRGAIPHRLT ASFILEESLA IATLNTLVQS
MASAAQAAGV QICAGDTKVV ERGKADGLYI TTTGIGWLPP DRQIGPQQIQ VGDAIVINGD
LGRHGIAVMA CREGLELELP FASDVASLRS PVLGLLEAGI DLHCLRDLTR GGLASALNEL
AAVGVGMDIE AKAIPVDPAV AGVCEILGLD PLQVANEGRF VAFLPTDQVA IALQILRQIW
PEQEPRCIGI VQSATPGLVR LRQPFGSDRL LEMLSGEQLP RIC