Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_B2661 |
Symbol | |
ID | 3772742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007595 |
Strand | - |
Start bp | 3160 |
End bp | 5040 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637798362 |
Product | cysteine desulfurase |
Protein accession | YP_398710 |
Protein GI | 81230380 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.0591977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.463261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATA CTGTTCCGTC CGTTCCGGCA GTGCCGAATC TGCCGACTCA ATCGGATCCC TTTTTCAATG AGCGATCGCT TGAACAACTG ACTCAAACAG TGTTGCAAGA TCTGCAGCAG GCTGGGGTTA GCGAAGCAGA ATCGGCTCCA ACGCCTCTAT CCGTTCCGAC CCCAGCACTC CCAACCACAT CAGCTTTAGC AGTCCCTCAA TCTCCTACTG CGATCGCGAA TGTCCCTGCT CCTCCGAGTT CTATCGATGA GCGATCGCTG GCTCAGTTGG CGCAAGCCGT TTTGCAAGAT CCGCAGTTGG CCAGTGCGAT CGCATCTATA TTCCCTTCAG TAACCCTTCC AACGTCAGCC TCAGTTCCTC GTTCAGTACC GGTGCCGCCG AGCTTTTTGC CGAGCCTCGT ACCCACAGCG CCACCGATTC ACGATGAAGT GGGCGTGATT CCGCATCATC AATTGCCAGT TCCCAGCCAG CCCACACCCG CAGGCTTGCA GCAGACTGCC TCAAGTAAGA GTGGCAGTGG TTTTTATTTC ATTGATGAAC AGGTCGAAAC CGCGATCGCG GCACTGCACA GCAATCTGAC CGTATTTCCA CAACTGACAA CGTCTTCAAT CCCAACTTTG ACCGGAGCTC ATTCAGCTGG AGCAGTTGGA TTTGATATTC ATCAAGTCCG GCGGGATTTT CCGATTTTGC AGGAGCGCGT CAATGGCCGT CCCCTAGTTT GGTTTGACAA TGCGGCGACG ACCCAGAAAC CTCAAGTCGT GATCGATCGC CTGTCGCACT ATTACCAACA CGAGAACTCC AATATTCACC GTGCGGCCCA TGAGCTGGCA GCGCGATCGA CGGATGCCTA TGAAGCGGCT CGTGAGCAAG TGCGGCATTT CCTTAATGCG GCCTCCACTG AGGAAGTCGT GTTTGTGCGG GGCACCACCG AGGCGATCAA TCTGGTTGCT AAAAGCTGGG GATCGCAGAA CCTCAAAGAA GGCGATGAAA TCGTCATTAC TTGGCTAGAG CACCATGCCA ACATCGTGCC TTGGCAACAG CTCAGTGCTG AGACAGGGGC CCGGCTGCGG GTCGTCCCTG TGGATGATTA TGGTCAAGTC CGCCTGGATG AATATCAAAA GTTGCTGAGC GATCGCACCA AGATCGTCTC ATTCACGCAG GTCTCCAATG CCCTCGGCAC AATTACGCCA GCCAAGGAAA TCATTGAACT GGCCCATCGT TACGGAGCGA AAGTGCTACT CGATGGCGCT CAGTCGGTCT CTCACTTGGC GGTCGATGTG CAAGCGCTGG ACTGCGACTG GTTCGTTTTC TCGGGCCACA AGGTCTTTGG CCCCACCGGA ATTGGCGTGC TCTATGGCAA ACAGGAGCTG CTTGATGCGA CGCTACCTTG GCAAAGTGGT GGCAACATGA TCGCCGATGT CACGTTTGAG AAAACGGTCT ATCAGCCGGC TCCGGCACGC TTTGAAGCTG GGACGGGCAA CATTGCTGAT GCTGTGGGTT TGGGAGCAGC GCTGGAGTAT GTCCAAAAGA TTGGGCTAGA GGCGATCGCT GCCTACGAGC ATGAGTTATT GGTTCATGGC ACTGCGCTGC TTAGTCAGAT TCCGGGATTA CGGCTGATCG GTACGGCTCC GCACAAGGCA GCAGTGCTGT CTTTTGTTCT CGAGGGCTTT AGTCCAGAGG CGATCGGTCA GGCATTGAAT CGAGAAGGGA TTGCAGTGCG GGCGGGGCAT CACTGCGCTC AGCCAATTCT GCGACGCTTC GGGCTGGAAA CAACGGTGCG GCCATCGCTG GCTTTTTACA ACACCTTCGA GGAGTTGGAG ACACTGGCAG CGGCGATTCG CCGGATTCAA ACGGGGAGCC TCGCCCTCTA A
|
Protein sequence | MTNTVPSVPA VPNLPTQSDP FFNERSLEQL TQTVLQDLQQ AGVSEAESAP TPLSVPTPAL PTTSALAVPQ SPTAIANVPA PPSSIDERSL AQLAQAVLQD PQLASAIASI FPSVTLPTSA SVPRSVPVPP SFLPSLVPTA PPIHDEVGVI PHHQLPVPSQ PTPAGLQQTA SSKSGSGFYF IDEQVETAIA ALHSNLTVFP QLTTSSIPTL TGAHSAGAVG FDIHQVRRDF PILQERVNGR PLVWFDNAAT TQKPQVVIDR LSHYYQHENS NIHRAAHELA ARSTDAYEAA REQVRHFLNA ASTEEVVFVR GTTEAINLVA KSWGSQNLKE GDEIVITWLE HHANIVPWQQ LSAETGARLR VVPVDDYGQV RLDEYQKLLS DRTKIVSFTQ VSNALGTITP AKEIIELAHR YGAKVLLDGA QSVSHLAVDV QALDCDWFVF SGHKVFGPTG IGVLYGKQEL LDATLPWQSG GNMIADVTFE KTVYQPAPAR FEAGTGNIAD AVGLGAALEY VQKIGLEAIA AYEHELLVHG TALLSQIPGL RLIGTAPHKA AVLSFVLEGF SPEAIGQALN REGIAVRAGH HCAQPILRRF GLETTVRPSL AFYNTFEELE TLAAAIRRIQ TGSLAL
|
| |