Gene SeD_A4461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4461 
Symbol 
ID6873413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4300448 
End bp4301437 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content52% 
IMG OID642787379 
Productsulfate transporter subunit 
Protein accessionYP_002217990 
Protein GI198245929 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT GGGGCGTGGG GTTCACATTA TTGCTGGCGT CAACCAGCAT TCTGGCAAGG 
GATATTCAGT TACTTAACGT ATCATACGAT CCAACGCGTG AGCTGTACGA GCAGTACAAC
AAAGCGTTTA GCGCGCACTG GAAGCAGGAA ACCGGCGACA ACGTCGTGAT CCGTCAATCG
CACGGCGGTT CGGGTAAACA GGCGACTTCC GTTATTAACG GCATTGAAGC CGATGTCGTG
ACGCTGGCGC TGGCTTACGA TGTGGACGCT ATTGCGGAAC GTGGTCGTAT CGATAAAAAC
TGGATCAAGC GTCTGCCGGA TAACTCAGCA CCTTACACCT CCACCATCGT TTTCCTGGTC
CGCAAAGGCA ATCCAAAACA AATTCATGAC TGGAATGATC TGATTAAACC CGGTGTGTCG
GTGATTACGC CAAACCCGAA AAGCTCCGGC GGCGCACGCT GGAACTATCT GGCGGCATGG
GGCTACGCCC TGCACCACAA CAATAACGAT CAGGCCAAAG CGCAGGACTT TGTCAAAGCC
CTGTTTAAAA ACGTTGAAGT GCTGGACTCC GGCGCGCGCG GCTCAACCAA TACTTTCGTT
GAACGCGGTA TCGGTGATGT GCTGATTGCC TGGGAAAACG AAGCGCTACT GGCGACGAAT
GAACTGGGTA AAGATAAATT CGAGATAGTG ACCCCGAGTG AATCTATTCT CGCAGAGCCG
ACCGTGTCCG TGGTGGATAA AGTCGTTGAG AAGAAAGGGA CGAATGCGGT GGCGGAAGCG
TATCTGAAGT ATCTCTACTC GCCGGAAGGG CAGGAGATAG CAGCGAAAAA CTTCTACCGT
CCGCGCGATG CTGACGTGGC GAAAAAATAC GACGATGCGT TTCCGAAGCT GAAGCTGTTC
ACCATTGACG AGGTGTTTGG CGGCTGGGCG AAGGCGCAAA AAGATCACTT CGCTAATGGC
GGTACGTTCG ACCAAATTAG CAAACGCTAA
 
Protein sequence
MKKWGVGFTL LLASTSILAR DIQLLNVSYD PTRELYEQYN KAFSAHWKQE TGDNVVIRQS 
HGGSGKQATS VINGIEADVV TLALAYDVDA IAERGRIDKN WIKRLPDNSA PYTSTIVFLV
RKGNPKQIHD WNDLIKPGVS VITPNPKSSG GARWNYLAAW GYALHHNNND QAKAQDFVKA
LFKNVEVLDS GARGSTNTFV ERGIGDVLIA WENEALLATN ELGKDKFEIV TPSESILAEP
TVSVVDKVVE KKGTNAVAEA YLKYLYSPEG QEIAAKNFYR PRDADVAKKY DDAFPKLKLF
TIDEVFGGWA KAQKDHFANG GTFDQISKR