Gene Spro_4806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4806 
Symbol 
ID5605477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5325003 
End bp5325998 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content55% 
IMG OID640940379 
Productsulfate transporter subunit 
Protein accessionYP_001481027 
Protein GI157373038 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000221787 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTATGC GTAAATGGGG TGCAGGTCTG ACAATAATGC TGCTGGCGTC CGGCGCCATG 
GCGAAAGATA TCCAATTGCT GAACGTTTCA TACGACCCGA CGCGTGAGTT CTATCAGGAA
TACAACACCG CATTCGGTAA ATACTGGCAG CAGCAGACCG GCGATAAAGT TACGGTGCGC
CAGTCGCATG GCGGCTCCGG CAAGCAGGCG ACTTCGGTGA TTAACGGCAT TGAGGCCGAC
GTGGTGACAC TGGCACTGGC CTATGACGTG GACGCTATCG CTGAGCGCGG GCGCATTGAT
AAAGAGTGGA TCAAACGTCT GCCGGACAAC TCGGCACCTT ATACCTCGAC CATTGTGTTC
CTGGTGCGCA AAGGTAATCC AAAGCAAATT CACGATTGGG CGGATTTGAT CAAACCGGGC
GTCTCGGTAA TCACCCCGAA CCCGAAAACT TCCGGTGGCG CACGCTGGAA CTATCTGGCA
GCCTGGGGTT ATGCACTGCA TCAGAACAAT AACGATCAGG CCAAGGCGCA AGAATTCGTT
AAAAACCTGT ATAAGAACGT CGAAGTGCTG GATTCCGGTG CGCGCGGTTC AACTAATACC
TTCGTTGAAC GCGGTATCGG TGATGTGCTG ATCGCCTGGG AGAACGAAGC GCTGCTGGCG
GAAAAAGAGC TGGGCAAGGA CAAGTTTGAG ATTATCACCC CAAGCGAATC GATTCTGGCC
GAGCCGACCG TGTCGGTGGT GGATAAAGTG GTTGATAAGC GCGGTACCCG TGATGTGGCT
ACGGCTTACC TGAAGTATCT GTATACGCCG GAAGGGCAGA CCATCGCGGC GAAAAACTAT
TACCGTCCAC GCGATGCGGC GGTAGCGGCC AAGTTTGCCG ACCAGTTCCC GAAACTGAAA
CTGTTTACCG TGGATGATAC TTTCGGCGGC TGGACCCAGG CGCAGAAGGT GCACTTTGCC
ACCGGCGGCG TGTTTGACGA AATCAGCAAA CGTTGA
 
Protein sequence
MRMRKWGAGL TIMLLASGAM AKDIQLLNVS YDPTREFYQE YNTAFGKYWQ QQTGDKVTVR 
QSHGGSGKQA TSVINGIEAD VVTLALAYDV DAIAERGRID KEWIKRLPDN SAPYTSTIVF
LVRKGNPKQI HDWADLIKPG VSVITPNPKT SGGARWNYLA AWGYALHQNN NDQAKAQEFV
KNLYKNVEVL DSGARGSTNT FVERGIGDVL IAWENEALLA EKELGKDKFE IITPSESILA
EPTVSVVDKV VDKRGTRDVA TAYLKYLYTP EGQTIAAKNY YRPRDAAVAA KFADQFPKLK
LFTVDDTFGG WTQAQKVHFA TGGVFDEISK R