Gene SeD_A2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2807 
SymbolcysA 
ID6874419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2678875 
End bp2679972 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content58% 
IMG OID642785861 
Productsulfate/thiosulfate transporter subunit 
Protein accessionYP_002216511 
Protein GI198245645 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTG AGATCGCCAG AATTAAGAAA TCCTTTGGTC GCACTCAGGT ACTGAATGAT 
ATCTCGCTGG ATATTCCTTC CGGCCAGATG GTTGCCTTGT TGGGGCCGTC AGGTTCCGGC
AAAACGACGC TGCTGCGCAT TATTGCCGGG CTGGAGCATC AGTCCAGCGG TCATATTCGT
TTCCACGGTA CGGACGTTAG CCGCCTGCAC GCGCGTGAGC GTAAAGTCGG TTTTGTGTTT
CAGCACTATG CGCTGTTTCG CCATATGACG GTGTTTGACA ACATCGCTTT TGGCCTGACG
GTGCTGCCGC GACGCGACCG CCCAACTGCG GCGGCGATTA AAACGAAAGT GACGCAATTG
CTGGAGATGG TGCAACTGGC GCACCTCGCG GATCGCTTCC CCGCCCAGCT TTCCGGCGGA
CAAAAACAGC GCGTGGCGCT GGCGCGTGCG CTTGCCGTAG AACCGCAAAT TCTACTGCTG
GATGAACCCT TTGGCGCGCT GGATGCCCAG GTACGTAAAG AGTTACGTCG CTGGTTGCGC
CAGCTACATG AAGAACTGAA ATTCACCAGC GTCTTTGTCA CCCACGATCA GGAAGAAGCG
ACGGAAGTGG CGGATCGGGT GGTGGTGATG AGTCAGGGCA ATATCGAACA GGCTGATGCG
CCGGATCGAG TGTGGCGTGA ACCGGCAACC CGCTTCGTAC TGGAGTTTAT GGGCGAGGTG
AACCGCCTGA CAGGCACCGT CCGCGGCGGG CAGTTTCACG TTGGCGCGCA TCGCTGGCCG
CTGGGTTATA CGCCCGCGTA CCAGGGGCCG GTCGATCTGT TCCTGCGTCC GTGGGAGGTG
GATATTAGCC GCCGCACCAG CCTGGATTCT CCGCTACCAG TACAGGTCAT TGAGGCCAGC
CCGAAAGGTC ACTATACACA GTTAGTGGTA CAGCCGCTGG GGTGGTATCA CGATCCGCTG
ACGGTGGTGA TGGCGGGTGA GGATGTTCCG GTTCGCGGCG AGCGTTTGTT TGTTGGACTG
CAAAAAGCGC GTCTGTATAA CGGCGACCAG CGTATTGAAA CGCGTGAAGA GGAACTTGCT
CTGGCGCAAT CGGCCTGA
 
Protein sequence
MSIEIARIKK SFGRTQVLND ISLDIPSGQM VALLGPSGSG KTTLLRIIAG LEHQSSGHIR 
FHGTDVSRLH ARERKVGFVF QHYALFRHMT VFDNIAFGLT VLPRRDRPTA AAIKTKVTQL
LEMVQLAHLA DRFPAQLSGG QKQRVALARA LAVEPQILLL DEPFGALDAQ VRKELRRWLR
QLHEELKFTS VFVTHDQEEA TEVADRVVVM SQGNIEQADA PDRVWREPAT RFVLEFMGEV
NRLTGTVRGG QFHVGAHRWP LGYTPAYQGP VDLFLRPWEV DISRRTSLDS PLPVQVIEAS
PKGHYTQLVV QPLGWYHDPL TVVMAGEDVP VRGERLFVGL QKARLYNGDQ RIETREEELA
LAQSA