Gene Spro_3807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3807 
Symbol 
ID5606918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4206834 
End bp4208039 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content61% 
IMG OID640939364 
Productcysteine sulfinate desulfinase 
Protein accessionYP_001480031 
Protein GI157372042 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily
[TIGR03392] cysteine desulfurase, catalytic subunit CsdA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000833089 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000586556 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACACCTT TTAATCCCAT CGTTTTTCGT AATCAGTTTC CTGCCTTGCA GCAAGCGGGC 
ATTTATCTCG ACAGCGCCGC CACCGCGTTG AAGCCGCTGG CGGTCATCAA TGCCACGCAG
CAGTTTTACC GCGACGATGC CGCTACCGTG CACCGCAGCC AGCACCGGGC GGCGCAGGAT
CTGACTGCAC GCTTCGAGCA GGCTCGCCAA CAGGTCGCGA CGCTGGTCAA TGCTCCTTCA
GCCGACGACA TTATCTGGAC CCGCGGCACC ACCGAAGCGA TCAATTTGGT AGCGCAGAGC
TATGCCCGCC CCCGGTTGCA GCCCGGTGAC GAAATCCTGG TGAGCGAGGC CGAACACCAC
GCCAACCTGA TCCCCTGGCT GATGGTGGCG GAACAAACCG GCGCTCGGGT GGTAAAACTG
CCACTGGGTG CAGACCGCCT GCCGGATCTG GCCCAGTTAC CCAGCCTGCT GAGTGATAAA
ACCCGCCTGC TGGCACTGGG CCAAATGTCC AACGTCACCG GCGGCTGTCC GGATCTGGAT
CTGGCCATCC GGCTTGCCCA TAGTGCCGGC GCTCTGGTGA TGATCGACGG CGCGCAGGGC
ATCGTTCACT GTCCGGCTGA CGTTCAGCGA CTAGACATTG ATTTCTACGC TTTCTCCGGC
CACAAATTAT ATGGCCCGAC CGGCATCGGT GCGCTATACG GTAAAAGCGA ATTGCTGGCG
CAGATGGCCC CCTGGCAAGG CGGCGGCAAG ATGCTGACTC AGGCCTCCTT CGACGGCTTC
ACGCCGCAAA AACCACCGCA CTGTTTTGAA GCCGGTACGC CGAATATCGC CGGTGTGCTG
GGGTTGGCCG CCGCATTGGA ATGGCTTGGC ACCCAGGATC TGGCGGCAGC CGAGCAATAC
AGCCGCGAAC TGGCCGATCT CGCCGAGAAA CAATTGGCGC AACTGCCGGG GTTCCGCAGC
TTCCGTTGTT CGGGCTCCAG CTTACTGGCG TTTGATATTG CCGGTATCCA TCACAGCGAT
ATCGTCACCC TGCTGGCAGA ACAAGGCATC GCACTGCGAG CCGGTCAGCA CTGCGCTCAA
CCGCTGATGG CGGCGCTGGG TGTCAGTGGG ACACTACGCG CCTCCTTTGC GCCATACAAC
ACGCGGGAAG ACGTCGATAC CCTGGTAACC GCCCTGCACA ACGCCATCGA CCTGTTGGCC
GATTAA
 
Protein sequence
MTPFNPIVFR NQFPALQQAG IYLDSAATAL KPLAVINATQ QFYRDDAATV HRSQHRAAQD 
LTARFEQARQ QVATLVNAPS ADDIIWTRGT TEAINLVAQS YARPRLQPGD EILVSEAEHH
ANLIPWLMVA EQTGARVVKL PLGADRLPDL AQLPSLLSDK TRLLALGQMS NVTGGCPDLD
LAIRLAHSAG ALVMIDGAQG IVHCPADVQR LDIDFYAFSG HKLYGPTGIG ALYGKSELLA
QMAPWQGGGK MLTQASFDGF TPQKPPHCFE AGTPNIAGVL GLAAALEWLG TQDLAAAEQY
SRELADLAEK QLAQLPGFRS FRCSGSSLLA FDIAGIHHSD IVTLLAEQGI ALRAGQHCAQ
PLMAALGVSG TLRASFAPYN TREDVDTLVT ALHNAIDLLA D