Gene Synpcc7942_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0839 
Symbol 
ID3774016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp833950 
End bp834954 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content56% 
IMG OID637799255 
ProductNitrilase 
Protein accessionYP_399857 
Protein GI81299649 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.29683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0736263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATA AAATCATTGT GGCAGCTGCA CAAATCCGAC CTGTTCTATT CAGTTTGGAA 
GGATCTGTTG CTCGGGTTCT AGCGGCCATG GCAGAAGCCG CAGCAGCGGG CGTTCAGCTG
ATTGTTTTCC CTGAAACCTT TCTGCCCTAC TATCCTTATT TCTCCTTCGT CGAACCGCCG
GTTCTGATGG GGCGATCGCA CCTCAAGCTC TACGAACAAG CCTTCACAAT GACGGGGCCG
GAACTCCAGC AAATTGCCAG GGCTGCTCGA CAGCATCGTC TCTTTGTTTT GCTCGGCGTC
AATGAGCGAG ATGGCGGTAG TCTTTACAAC ACTCAGCTAT TGATTAGCGA TCAGGGTGAC
TTACTTCTGA AGCGCCGCAA AATCACCCCG ACCTATCACG AACGGATGGT CTGGGGACAA
GGCGGTGGCG CGGGCCTAAC CGTTGTCGAA ACGGTGCTGG GTAAGGTTGG GGCCTTGGCT
TGCTGGGAGC ACTACAACCC CTTAGCCCGC TTCAGTCTGA TGACTCAGGG GGAGGAAATT
CACTGCGCCC AATTCCCAGG ATCGCTGGTG GGGCCAATTT TTAGCGAGCA AACCGCGGTC
ACACTGCGTC ACCATGCCCT CGAAGCCGGT TGCTTTGTGC TCAGCTCGAC GGCTTGGTTG
GATCCGGCCG ACTACGACAC AATCACTCCC GATCGTAGTT TGCACAAAGC CTTTCAAGGG
GGCTGTCACA CCGCGATCAT CAGTCCCGAA GGGCGCTATC TCGCAGGGCC ACTGCCCGAG
GGCGAAGGAC TCGCGATCGC TGAACTCGAC AAAAGTTTGA TCACCAAACG TAAGCGAATG
ATGGACAGCG TCGGTCACTA TTCGCGCCCC GATTTACTCA GCCTTCGGAT CAATCGCAGC
CCTGCCACAC AAGTGCAAGC GATCGGTTCA GCTGCTGCCT TGCCAGAACT ACCGAACCTA
GAAGCCGCAC CGGCTGAGAC TGCGGAGGAC TATCTCCATG CCTAA
 
Protein sequence
MADKIIVAAA QIRPVLFSLE GSVARVLAAM AEAAAAGVQL IVFPETFLPY YPYFSFVEPP 
VLMGRSHLKL YEQAFTMTGP ELQQIARAAR QHRLFVLLGV NERDGGSLYN TQLLISDQGD
LLLKRRKITP TYHERMVWGQ GGGAGLTVVE TVLGKVGALA CWEHYNPLAR FSLMTQGEEI
HCAQFPGSLV GPIFSEQTAV TLRHHALEAG CFVLSSTAWL DPADYDTITP DRSLHKAFQG
GCHTAIISPE GRYLAGPLPE GEGLAIAELD KSLITKRKRM MDSVGHYSRP DLLSLRINRS
PATQVQAIGS AAALPELPNL EAAPAETAED YLHA