Gene Synpcc7942_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1070 
Symbol 
ID3774002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1081730 
End bp1082851 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content57% 
IMG OID637799494 
Productoxidoreductase aldo/keto reductase 
Protein accessionYP_400087 
Protein GI81299879 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.587482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00485834 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACTATC GCCAATATGG TCGGACAGGG CGATTGATCT CGCGCTTTTC CCTCGGCTTG 
ATGCGCTGTT TGGACTCTGC CGCGCAATTG GAGGCTGTGC TTGATGCTGC GTGGCGGCTT
GGCATCAACC ACTTTGAGAC GGCGCAGTCC TACGGCCCCA GCGAAGCCTA TCTGGCGCAA
GCCCTGCATT CTCTGCAATT GCCCCGTGAT CAAGTCATCA TCACGACTAA AATCCTGCCC
GATCGCGACC CCCAGCAGAT GGTAGAAGCC GTGTTGCGAT CGCGTGATCG CTTAGGGATT
GACTCGATCG ATTGTCTTGC CCTGCATGGT CTCAACCAGC CCGAGCATTT GCAGCAGGCG
ATCGCTGCAT TACCGGCGCT CCAAACCCTG CAAGCGGAAG GCGTCTTTCA GCATTTGGGC
TTTTCCAGTC ACGGCGATCG CGAATTGATT CTGGAGGCGA TCGCTACCGA TGCGTTTGAC
TTTGTCAGCC TCCATTACTA CCTGCTGTTT CAACGTCACG CGCCGGTCAT TGAAGCAGCT
GCAGCCAAAA ATCTAGGAAT TTTCATCATT TCGCCCGTCG ATAAGGGTGG ACTCCTGCAC
CAACCTTCTG CCCAACTGAT CGAGGACTGT CAGCCCTTCA GTCCTCTGGC ACTCAACTAT
CGATTTCTGC TCAGCGATCG CCGGATTACA ACCCTCAGTT TTGGTGCTGC AAAGGCCGAG
GAATTAGCGG TTCTTCAGGA CTTCGTTGAT GCGGATCAGC CGCTGAGTCT GGAGGAAGCT
GAGGCGATCG CGCGACTGGA ACAAGTTCGC CAGCAGCGGC TGGGCAGGGA CTACTGTCAG
CAGTGTTATG CCTGTTTGCC CTGTCCCGAG GCGATCAACA TTCCTGAGGT ACTGCGGCTG
CGGAATCTGG CAATCGCCCA CGACATGCAA GCCTACGGAC GATATCGATA TCGCATGTTT
GAAAATGCCG GACATTGGTT CCCGGGGCAG CGAGGCAGCC GCTGCACGGA TTGTGGCGAT
TGCCTACCCC GTTGCCCCCA TCACTTGCCG ATCGCGGATT TGGTGCGCGA TGCTGATCAG
CGATTAGCAG GCGCTCCTCG GCGGCGTTTG TGGGGAGATT AG
 
Protein sequence
MHYRQYGRTG RLISRFSLGL MRCLDSAAQL EAVLDAAWRL GINHFETAQS YGPSEAYLAQ 
ALHSLQLPRD QVIITTKILP DRDPQQMVEA VLRSRDRLGI DSIDCLALHG LNQPEHLQQA
IAALPALQTL QAEGVFQHLG FSSHGDRELI LEAIATDAFD FVSLHYYLLF QRHAPVIEAA
AAKNLGIFII SPVDKGGLLH QPSAQLIEDC QPFSPLALNY RFLLSDRRIT TLSFGAAKAE
ELAVLQDFVD ADQPLSLEEA EAIARLEQVR QQRLGRDYCQ QCYACLPCPE AINIPEVLRL
RNLAIAHDMQ AYGRYRYRMF ENAGHWFPGQ RGSRCTDCGD CLPRCPHHLP IADLVRDADQ
RLAGAPRRRL WGD