Gene Synpcc7942_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1738 
Symbol 
ID3775438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1808413 
End bp1809672 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID637800177 
Productcysteine desulfurase 
Protein accessionYP_400755 
Protein GI81300547 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.948748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCGA TCGCCCTGAC CAATCTTGCG ACTGAGGTTC GTGCCGACTT TCCGATCCTG 
CAGCAGCAGG TCAACGGTCA GCCCTTGGTC TATCTCGACA ATGCGGCCAC TTCGCAAAAA
CCGAGAGCGG TGATCCAGTC CCTCGTCGAT TACTACGAGG GCTACAACAG CAACGTCCAT
CGCGGCGTCC ACACCCTGAG CGGCAAGGCA ACGGATGCCT ACGAAGGGGC ACGGCAAAAG
GTGGCGCGGT TCATCAACGC CAAGACGGAA CAGGAGATTG TCTACACCCG CAATGCCAGC
GAGGCGATCA ACCTCGTCGC CTACAGCTTC GGCATGAACT TTCTCCAAGC CGGTGATGAG
ATCATCCTCT CGGCGATGGA GCACCACAGC AACCTAATCC CTTGGCAGTT TGTGGCAGCG
AAAACAGGAG CGGCGCTGAA ATTCGTCGGG GTGACTGAGA CCGGTCAGTT CGACCTCGAG
CAGTTCCGTA GCCTCCTCAG CGATCGCACC AAACTGGTGT CGGTCGTCCA TGTTTCTAAT
ACGCTAGGTT GCTGCAATCC GGTCACGGAA ATTTGTCAGC TCGCCCATGC CAAGGGTGCG
CGGGTGTTGA TTGATGCCTG CCAAAGTCTT CCCCACCAGG CGATCGATGT TCAGGCGATC
GATTGCGATT GGCTGGTTGG CTCTGGCCAC AAAATGTGTG CACCGACGGG CATTGGTTTC
CTCTACGGCA AGCTCGACCT GTTGCGTCAA ATGCCGCCCT TCCTCGGCGG TGGTGAAATG
ATCGCCGATG TCTTCCTTGA TCACGCAACT TACGCCGATC TTCCTCACAA ATTCGAAGCA
GGAACACCGG CAATTGGAGA AGCGATCGCA TTGGGCGCGG CGATTGATTA TCTAACCGCG
ATCGGCATGG ATCGCATTCA CGCCTACGAA CAGCAGCTAA CCCAACACCT CTTCCAACGG
CTGGCAGAAA TTCCTGAGCT GACCGTCTAC GGACCTACGC CGGAGCAAGA TCGCGATCGC
GCTGCCCTCG CCGCCTTTAC CGCCGGTGCA GTCCATCCCC ACGATCTCTC GACCATCCTC
GACCAGTCAG GCATTGCGAT TCGAGCCGGG CATCACTGCA CTCAGCCACT GCACCGTGAA
TTACAAGTCC AATCAACGGC GCGGGCTAGT TGTTATTTCT ACAATACGAC TGACGAGATC
GATCGTCTGA TCGAGTCTCT CAAGGAAGCC GTTGAGTTCT TTGGAGCGAT TTTCAGCTAG
 
Protein sequence
MTAIALTNLA TEVRADFPIL QQQVNGQPLV YLDNAATSQK PRAVIQSLVD YYEGYNSNVH 
RGVHTLSGKA TDAYEGARQK VARFINAKTE QEIVYTRNAS EAINLVAYSF GMNFLQAGDE
IILSAMEHHS NLIPWQFVAA KTGAALKFVG VTETGQFDLE QFRSLLSDRT KLVSVVHVSN
TLGCCNPVTE ICQLAHAKGA RVLIDACQSL PHQAIDVQAI DCDWLVGSGH KMCAPTGIGF
LYGKLDLLRQ MPPFLGGGEM IADVFLDHAT YADLPHKFEA GTPAIGEAIA LGAAIDYLTA
IGMDRIHAYE QQLTQHLFQR LAEIPELTVY GPTPEQDRDR AALAAFTAGA VHPHDLSTIL
DQSGIAIRAG HHCTQPLHRE LQVQSTARAS CYFYNTTDEI DRLIESLKEA VEFFGAIFS