Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_2183 |
Symbol | |
ID | 5605752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 2384346 |
End bp | 2385566 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640937719 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001478412 |
Protein GI | 157370423 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000179204 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000283455 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTTTTC CGATTGAACG GGTACGCAGT GAATTTCCGC TGTTGGCACG AGAAGTCAAT GGCCAGCCGC TGGCTTACCT CGACAGTGCC GCCAGCGCGC AGAAACCGCA GGCAGTCATC GATCGCGAGC TGGATTTTTA CCGTCATGGC TACGCGGCGG TACACCGCGG TATTCATACG TTAAGCGCCG AAGCGACGCA GGAAATGGAA GCGGTGCGTG AAAAGGTGGC GGCGTTTATC AATGCCGGTT CGGCGGAAGA GATAATCTTC GTCAAGGGCA CTACCGAAGG CATCAATCTG GTAGCCAACA GTTTTGGCCG CCACTTTTTG CAGCCCGGTG ACAGCATTAT CATCACCGAG ATGGAGCACC ACGCCAACAT CGTACCCTGG CAGATGCTGG CACAGGAACG CGGATTAAAT CTGCGTGTCT GGCCGCTACA GCCAGATGGC ACCCTGGATT TGGCCCGGTT GCCGGGCTTG ATCGACGCGT CAACCAAACT ATTGGCGCTA ACCCAGGTGT CCAACGTGTT GGGCACGGTG AACCCGGTGC AGGAAATTAC GGCCCAGGCA AAAGCGGCGG GCCTGAAGGT ACTGATTGAC GGTGCGCAGG CAGTGATGCA CCAGCGGGTG GATGTTCAGG CGCTGGATTG CGATTTCTAC GTGTTCTCCG GGCATAAGCT GTATGGGCCT TCCGGCATCG GTATTCTTTA CGGTCGTCAG GCACTGTTGC AGCAAATGCC ACCCTGGGAA GGGGGCGGGT CGATGATCCA ACAGGTCAGC CTGACGGCAG GAACCACCTA CGCCGAGCCA CCGTGGCGCT TTGAGGCCGG TTCACCCAAC ACCGCCGGCA TGATGGGATT AGGTGCGGCA ATTGACTATG TGAATACTCT GGGGCTGGAG GCGATTGGTG ACTACGAGCA GTCGCTGATG CATTACGCGC TGGAGGCTCT GCAACAGGTA CCCCGGCTTA AAATCTACGG CCCGGCGGAG CGTGCCGGAG TGATCGCCTT TAACCTGGGT GAGCACCACG CCTATGACGT CGGCAGCTTC CTGGATCAGT ACGGCATTGC CATTCGTACC GGCCACCATT GCGCGATGCC GCTGATGGCG TTCTATAATG TGCCGAGCAT GTGCCGCGCT TCGTTGGCGT TATATAATAC GCGCGATGAA GTAGACCGGC TGGTGGCCGG ATTGCAGCGT ATCCAGAAAC TGCTGGGATA G
|
Protein sequence | MSFPIERVRS EFPLLAREVN GQPLAYLDSA ASAQKPQAVI DRELDFYRHG YAAVHRGIHT LSAEATQEME AVREKVAAFI NAGSAEEIIF VKGTTEGINL VANSFGRHFL QPGDSIIITE MEHHANIVPW QMLAQERGLN LRVWPLQPDG TLDLARLPGL IDASTKLLAL TQVSNVLGTV NPVQEITAQA KAAGLKVLID GAQAVMHQRV DVQALDCDFY VFSGHKLYGP SGIGILYGRQ ALLQQMPPWE GGGSMIQQVS LTAGTTYAEP PWRFEAGSPN TAGMMGLGAA IDYVNTLGLE AIGDYEQSLM HYALEALQQV PRLKIYGPAE RAGVIAFNLG EHHAYDVGSF LDQYGIAIRT GHHCAMPLMA FYNVPSMCRA SLALYNTRDE VDRLVAGLQR IQKLLG
|
| |