Gene Syncc9605_2279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2279 
Symbol 
ID3736221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp2087612 
End bp2088892 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content62% 
IMG OID637776863 
Productcysteine desulfurase 
Protein accessionYP_382568 
Protein GI78213789 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0879534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.284037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG CTGCTGAAGC ACGATCCAGT GCTGATTTTG TGGATTTATC GTCTCGATAT 
CGTGCCGATT TTCCGATTCT TAAGCAGCGT GCCCCCGACG GAAGGCCGCT GATTTATCTC
GATCACGCTG CCACCAGCCA GAAGCCCCGT CAGGTGCTCG AGGCGCTGCA GCACTACTAC
AGCTGCGACA ACGCCAACGT GCACCGCGGC GCTCACCAGC TCAGTGCCCG TGCCACCGAT
GCCTTCGAAG CGGCCCGCAG CACGACGGCG GCCTTCATTG ATGCCGCCAG TCCCCGGGAG
ATTGTCTTCA CCCGCAATGC AAGCGAAGCC ATCAATCTGG TGGCGCGCAC CTGGGGTGAC
GCGAACCTCA AGCAAGGGGA TGAAATTCTG CTCACGGTGA TGGAGCACCA CAGCAACCTG
GTGCCCTGGC AGCTGTTGGC CCAGCGCACC GGCTGCGTGC TGCGCCATGT GGGCATCACC
GATTCCGGCG AGCTGGATCT CGAGGATTTT CGGGCCCAGC TCAATGAGCG CACTCGGCTC
GTGAGCCTGG TGCACATCAG CAATTCCCTG GGTTGCTGCA ATCCCCTCGA TCAGGTCATC
CCCGCGGCCC ATGCCGTTGG CGCCTGCGTC CTCGTGGATG CCTGCCAGAG CCTGGCTCAC
AAGCCCATTG ATGTGGTGGC CCTTGATGCC GACTTCCTGG TCGGCTCGTC CCACAAGCTC
TGTGGTCCCA CCGGCATGGG ATTCCTCTGG GCGCGGGAGT CGCTGCTGGA GGCGATGCCT
CCCTTCCTGG GCGGCGGCGA GATGATTCAG GACGTTTTCC TGGACCACAG CACTTGGGCG
GTGTTGCCCC ATAAGTTCGA AGCGGGCACC CCTGCCATTG GTGAGGCGGT GGGCATGGGG
GCAGCGATTC GCTATCTGCA GATGGTGGGT CTGGAGGCGA TTCAGGCCTG GGAGGCGCAG
CTCACCAGGC ATCTGTTCGC TCGGCTGCAG GCCATTGATG GTGTGCGGGT CTTGGGCCCT
ACGCCGGATC AACAGCCCGA GCGTGGTGCC CTCGCCACAT TCCTGGTGGA TGGCGTGCAT
GCCAACGACA TCGCCGCTCT GATTGATGCC TCCGGGATCT GCATTCGCAG TGGTCACCAC
TGCTGTCAGC CCTTGCACCG CATCTATGAC GTGACAGCTT CAGCGCGGGC CAGCCTGAGT
TTCACCAGCA CCTTTGAAGA GATCGACCGC TTCAGCGAAG AGCTCGCTTC CACGGTCGCT
TTCTTGCGCG AGCACAGCTG A
 
Protein sequence
MTIAAEARSS ADFVDLSSRY RADFPILKQR APDGRPLIYL DHAATSQKPR QVLEALQHYY 
SCDNANVHRG AHQLSARATD AFEAARSTTA AFIDAASPRE IVFTRNASEA INLVARTWGD
ANLKQGDEIL LTVMEHHSNL VPWQLLAQRT GCVLRHVGIT DSGELDLEDF RAQLNERTRL
VSLVHISNSL GCCNPLDQVI PAAHAVGACV LVDACQSLAH KPIDVVALDA DFLVGSSHKL
CGPTGMGFLW ARESLLEAMP PFLGGGEMIQ DVFLDHSTWA VLPHKFEAGT PAIGEAVGMG
AAIRYLQMVG LEAIQAWEAQ LTRHLFARLQ AIDGVRVLGP TPDQQPERGA LATFLVDGVH
ANDIAALIDA SGICIRSGHH CCQPLHRIYD VTASARASLS FTSTFEEIDR FSEELASTVA
FLREHS