Gene Spro_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2183 
Symbol 
ID5605752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2384346 
End bp2385566 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content58% 
IMG OID640937719 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001478412 
Protein GI157370423 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000179204 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000283455 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTTC CGATTGAACG GGTACGCAGT GAATTTCCGC TGTTGGCACG AGAAGTCAAT 
GGCCAGCCGC TGGCTTACCT CGACAGTGCC GCCAGCGCGC AGAAACCGCA GGCAGTCATC
GATCGCGAGC TGGATTTTTA CCGTCATGGC TACGCGGCGG TACACCGCGG TATTCATACG
TTAAGCGCCG AAGCGACGCA GGAAATGGAA GCGGTGCGTG AAAAGGTGGC GGCGTTTATC
AATGCCGGTT CGGCGGAAGA GATAATCTTC GTCAAGGGCA CTACCGAAGG CATCAATCTG
GTAGCCAACA GTTTTGGCCG CCACTTTTTG CAGCCCGGTG ACAGCATTAT CATCACCGAG
ATGGAGCACC ACGCCAACAT CGTACCCTGG CAGATGCTGG CACAGGAACG CGGATTAAAT
CTGCGTGTCT GGCCGCTACA GCCAGATGGC ACCCTGGATT TGGCCCGGTT GCCGGGCTTG
ATCGACGCGT CAACCAAACT ATTGGCGCTA ACCCAGGTGT CCAACGTGTT GGGCACGGTG
AACCCGGTGC AGGAAATTAC GGCCCAGGCA AAAGCGGCGG GCCTGAAGGT ACTGATTGAC
GGTGCGCAGG CAGTGATGCA CCAGCGGGTG GATGTTCAGG CGCTGGATTG CGATTTCTAC
GTGTTCTCCG GGCATAAGCT GTATGGGCCT TCCGGCATCG GTATTCTTTA CGGTCGTCAG
GCACTGTTGC AGCAAATGCC ACCCTGGGAA GGGGGCGGGT CGATGATCCA ACAGGTCAGC
CTGACGGCAG GAACCACCTA CGCCGAGCCA CCGTGGCGCT TTGAGGCCGG TTCACCCAAC
ACCGCCGGCA TGATGGGATT AGGTGCGGCA ATTGACTATG TGAATACTCT GGGGCTGGAG
GCGATTGGTG ACTACGAGCA GTCGCTGATG CATTACGCGC TGGAGGCTCT GCAACAGGTA
CCCCGGCTTA AAATCTACGG CCCGGCGGAG CGTGCCGGAG TGATCGCCTT TAACCTGGGT
GAGCACCACG CCTATGACGT CGGCAGCTTC CTGGATCAGT ACGGCATTGC CATTCGTACC
GGCCACCATT GCGCGATGCC GCTGATGGCG TTCTATAATG TGCCGAGCAT GTGCCGCGCT
TCGTTGGCGT TATATAATAC GCGCGATGAA GTAGACCGGC TGGTGGCCGG ATTGCAGCGT
ATCCAGAAAC TGCTGGGATA G
 
Protein sequence
MSFPIERVRS EFPLLAREVN GQPLAYLDSA ASAQKPQAVI DRELDFYRHG YAAVHRGIHT 
LSAEATQEME AVREKVAAFI NAGSAEEIIF VKGTTEGINL VANSFGRHFL QPGDSIIITE
MEHHANIVPW QMLAQERGLN LRVWPLQPDG TLDLARLPGL IDASTKLLAL TQVSNVLGTV
NPVQEITAQA KAAGLKVLID GAQAVMHQRV DVQALDCDFY VFSGHKLYGP SGIGILYGRQ
ALLQQMPPWE GGGSMIQQVS LTAGTTYAEP PWRFEAGSPN TAGMMGLGAA IDYVNTLGLE
AIGDYEQSLM HYALEALQQV PRLKIYGPAE RAGVIAFNLG EHHAYDVGSF LDQYGIAIRT
GHHCAMPLMA FYNVPSMCRA SLALYNTRDE VDRLVAGLQR IQKLLG