Gene Spro_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3454 
Symbol 
ID5604733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3825427 
End bp3826515 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content60% 
IMG OID640939007 
Productsulfate/thiosulfate transporter subunit 
Protein accessionYP_001479680 
Protein GI157371691 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.1858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTG AGATTAACGG TATCAATAAG TTTTTCGGTC GTACCAAGGT ATTGAACGAT 
ATCTCGCTCG ACATTGCCTC CGGTGAGATG GTGGCACTGC TGGGGCCGTC CGGCTCCGGT
AAAACCACGC TGCTGCGTAT TATCGCCGGG CTGGAAAGCC AGAGCGGCGG CAAGCTGGGC
TTCCACGGCA CCGACGTCAG CCACGTGCAT GCCCGCGATC GTCGAGTGGG CTTCGTGTTC
CAGCATTACG CGCTGTTCCG CCACATGACG GTGTTCGACA ACATCGCCTT TGGCCTGAGC
GTGCTGCCGC GCCGTGAGCG CCCGAATGCC GCGGCGATCA AACAAAAAGT GACCCAGTTG
CTGGAAATGG TGCAGTTGGC CCATTTGGCT AACCGTTATC CGTCACAGCT TTCCGGTGGT
CAGAAGCAGC GCGTGGCCCT GGCGCGTGCA CTGGCGGTCG AACCGCAAAT TCTGCTGCTG
GATGAACCCT TCGGCGCGCT GGATGCGCAG GTGCGTAAAG AACTGCGTCG TTGGCTGCGT
CAACTGCATG AAGAACTGAA ATTTACCAGC GTGTTTGTCA CCCACGATCA GGAAGAGGCA
ATGGAAGTCG CCGATCGTAT CGTGGTGATG AGCCAGGGCA ATATTGAGCA GGTCGGTTCG
CCGGAAGAGA TTGTACGTGA ACCGGCCAGC CGCTTCGTGC TGGAATTTAT GGGCGAAGTG
AACCGCCTGA GCGGCGAGAT CCGCGGTTCG CAGCTGTTCG TCGGTGCGCA CCAGTGGCCT
CTGTCGTTCC AGCCAATGCA CCAGGGCCGC GTGGACTTGT TCCTGCGCCC GTGGGAAATG
GAAGTCGGTA CCGAGAGCAG CGACCGCTGC CCGCTGCCGG TGCAGGTGCT GGAAGTCAGC
CCTCGCGGCC ATTTCTGGCA GATGACCGTG CAGCCGATTG GCTGGCATCA GGAACCTATC
AGCGTGGTGC TGCCGGAGGG TAACGAACCG CCGGTACGCG GTGGCCGCTA CTACGTTGGC
AGCCTGAATG CGCGCCTGTA CGCCGGTGAC CAACTGCTAC AACCTGTTGC GTTAGCTAAA
AGCGCCTGA
 
Protein sequence
MSIEINGINK FFGRTKVLND ISLDIASGEM VALLGPSGSG KTTLLRIIAG LESQSGGKLG 
FHGTDVSHVH ARDRRVGFVF QHYALFRHMT VFDNIAFGLS VLPRRERPNA AAIKQKVTQL
LEMVQLAHLA NRYPSQLSGG QKQRVALARA LAVEPQILLL DEPFGALDAQ VRKELRRWLR
QLHEELKFTS VFVTHDQEEA MEVADRIVVM SQGNIEQVGS PEEIVREPAS RFVLEFMGEV
NRLSGEIRGS QLFVGAHQWP LSFQPMHQGR VDLFLRPWEM EVGTESSDRC PLPVQVLEVS
PRGHFWQMTV QPIGWHQEPI SVVLPEGNEP PVRGGRYYVG SLNARLYAGD QLLQPVALAK
SA