Gene Synpcc7942_1929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1929 
Symbol 
ID3775292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2003133 
End bp2004293 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID637800371 
Productcysteine desulfurase 
Protein accessionYP_400946 
Protein GI81300738 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.685974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACCT ACTTGGACTA CAGCGCCACG ACCTCGATGC GGCCTGAGGT TTTGGAGCGT 
TTCACGGCCG TTGCCCAAGA GCAGTGGGGC AATGCAGCTA GCTTGCACCA GTGGGGTAAT
CGCGCAGCCC TCGTGCTAGA GCGATCGCGC CAGCAGGTCG CGGCCCTGAT CCAAGCGGAA
CCGGAGGCAA TCGCCTTTAG CTCCGGCGGT ACGGAATCGG ATAACTGGGC AATTCTCAGT
CCCTATCTTG CAGACCCGCG GCCGGGGCAT CTGATCATCT CCGCCGTCGA ACATTCCGCG
ATCGCTCGGC CAGCCGCTTG GCTAGAGCAA CGGGGCTGGC AGGTAACGCG CTTGCCGGTT
GATCGTAGCG GCCGCATTCA ACCCGCGGAT CTGGCCAGTG CGGTTCGCCC AGACACCCGC
CTGATCTCAG TGATCTGGGG TCAAAGCGAA GTCGGAACGA TCCAACCGAT CGCAGAGCTT
GCCGCGATCG CTCGTGAACA TGGCATCCTC TTCCACACCG ACGCGGTCCA AGTAGCGGGA
CGTTTACCGA TCGATGTGCA GCGGCTGCCG ATCGATTTGC TCTCGCTCTC CAGCCATAAA
CTCTACGGTC CCCAAGGAGT TGGGGCACTC TACATTCGGC CAGGTGTTGA GTTAGCGCCC
CTCCTGCAGG GTGGGAATCA AGAAAGTGGC CTGCGCTCGG GAACGCCACC GATTGCTGCG
ATCGCGGCCT TTGGTGAAGC CGCCCAGCTT GCCGCCGCCG AACTCCCCCA CGAGACGGCG
CGCCTCCAAT CCCTGCGCGA TCGCCTCATT GCGGTACTGG CCACCGAACC CCGTCTCAGG
CTGACAGGTG ACCCCATCCA GCGGCTGCCC CATCATGCCA GTTTTATCGC CCGAGGTGGC
ACAACTGGGA CCAGTCAACA GCTGGTACGA GCCATGAATC GGCTGGGTTT TGGCATCAGT
GGCGGTTCTG CTTGCAACAG TGGCCGCAGC CAGCCCAGTC CCGTCCTGCT AGCCATGGGG
TATAGTCCCC AAGAAGCCTT GGCAGGTATT CGTTTCAGCC TGGGTCGATC GACCCAGTTG
GCTGAGGTAG AAGCGGCGGC GATCGCCCTG CGATCGGCGC TCCACAGCTT GCCCCAAGCC
TCGTTGTTGT CTCCGGCCTA A
 
Protein sequence
MSTYLDYSAT TSMRPEVLER FTAVAQEQWG NAASLHQWGN RAALVLERSR QQVAALIQAE 
PEAIAFSSGG TESDNWAILS PYLADPRPGH LIISAVEHSA IARPAAWLEQ RGWQVTRLPV
DRSGRIQPAD LASAVRPDTR LISVIWGQSE VGTIQPIAEL AAIAREHGIL FHTDAVQVAG
RLPIDVQRLP IDLLSLSSHK LYGPQGVGAL YIRPGVELAP LLQGGNQESG LRSGTPPIAA
IAAFGEAAQL AAAELPHETA RLQSLRDRLI AVLATEPRLR LTGDPIQRLP HHASFIARGG
TTGTSQQLVR AMNRLGFGIS GGSACNSGRS QPSPVLLAMG YSPQEALAGI RFSLGRSTQL
AEVEAAAIAL RSALHSLPQA SLLSPA