Gene SeD_A0111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0111 
SymbolthiP 
ID6875403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp119398 
End bp121008 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID642783364 
Productthiamine transporter membrane protein 
Protein accessionYP_002214058 
Protein GI198245222 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGCTGA TTCCGGGGCT GTGCGCCGCC 
GCGCTGATGA TAACCGTCTC GCTGGCGGCC TTTCTGGCGC TTTGGCTGAA CGCGCCGTCG
GGCGCGTGGT CGACAATCTG GCGGGATAGC TACCTGTGGC ATGTGGTGCG CTTCTCATTC
TGGCAGGCGT TTCTGTCCGC AGTGCTGTCT GTGGTCCCGG CGGTTTTTCT CGCCCGCGCG
CTTTATCGAC GCCGTTTTCC TGGACGTCTG GCGCTGCTGC GTCTGTGCGC CATGACGCTG
ATTCTGCCGG TACTGGTTGC CGTGTTCGGT ATTCTTAGCG TGTATGGCCG TCAGGGCTGG
CTGGCCTCGC TGTGGCAGAT GCTGGGGCTT CAGTGGACAT TCTCCCCCTA CGGCTTGCAG
GGCATTTTAC TGGCGCACGT CTTTTTTAAC CTGCCAATGG CGAGCCGTTT GTTGCTGCAA
TCTTTGGAAA GCATTCCCGG CGAGCAGCGC CAGCTCGCCG CCCAGCTCGG TATGCGCGGC
TGGCATTTTT TCCGTTTTGT CGAATGGCCG TGGCTGCGCC GCCAAATTCC ACCCGTCGCG
GCGCTGATTT TTATGCTCTG CTTCGCCAGT TTCGCAACGG TTCTGTCGCT CGGCGGCGGC
CCGCAGGCCA CCACTATCGA ACTGGCTATC TTTCAGGCGC TTAGTTATGA CTACGATCCC
GCCCGCGCGG CGATGCTGGC ATTAATCCAG ATGGTCTGCT GCCTGGCGCT GGTACTGCTA
AGCCAACGGC TGAGCAAAGC GATTGCGCCG GGGATGACGC TGACGCAGGG TTGGCGCGAT
CCTGACGATC GTCTCCACAG CCGTCTGACG GACGCCTTAT TAATCGTGCT GGCGCTGCTG
CTGCTGCTTC CGCCGCTGGT CGCCGTGGTG GTCGATGGCG TCAACCGCAG CCTGCCGGAG
GTGCTGGCGC AACCCATTCT GTGGCAGGCT GTCTGGACAT CGCTACGCAT TGCGCTGGCG
GCGGGCGTTC TGTGTGTGGT GCTGACCATG ATGCTACTGT GGAGTAGCCG CGAGCTACGC
CAACGCCAGC AAGTCTTCGC CGGACAAACG CTGGAACTCA GCGGGATGTT GATCCTCGCG
ATGCCGGGGA TCGTGCTGGC GACAGGCTTC TTTTTACTGC TCAATAGCAG CGTTGGCTTA
CCGGAATCCG CCGACGGCAT CGTGATTTTC ACCAATGCGC TGATGGCAAT CCCCTACGCG
CTAAAAGTCC TGGAAAACCC AATGCGCGAT ATTACCGCCC GTTATGGAAT GCTGTGCCAG
TCATTAGGAA TTGAAGGCTG GTCGCGGTTA AAGGTCGTGG AACTGCGGGC GCTGAAACGT
CCGCTGGCGC AGGCGCTGGC GTTTGCCTGC GTACTTTCTA TCGGTGATTT TGGCGTCGTC
GCGCTTTTCG GCAATGACAA TTTCCGCACG CTGCCGTTTT ATCTGTATCA GCAGATCGGC
TCCTACCGCA GCCAGGACGG CGCGGTGACC GCGCTGATAC TGCTGCTGCT CTGCTTTACG
TTATTTACCC TCATAGAGAA ACTACCGGGC CGACATGCTA AAACTGATTG A
 
Protein sequence
MATRRQPLIP GWLIPGLCAA ALMITVSLAA FLALWLNAPS GAWSTIWRDS YLWHVVRFSF 
WQAFLSAVLS VVPAVFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LASLWQMLGL QWTFSPYGLQ GILLAHVFFN LPMASRLLLQ SLESIPGEQR QLAAQLGMRG
WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI FQALSYDYDP
ARAAMLALIQ MVCCLALVLL SQRLSKAIAP GMTLTQGWRD PDDRLHSRLT DALLIVLALL
LLLPPLVAVV VDGVNRSLPE VLAQPILWQA VWTSLRIALA AGVLCVVLTM MLLWSSRELR
QRQQVFAGQT LELSGMLILA MPGIVLATGF FLLLNSSVGL PESADGIVIF TNALMAIPYA
LKVLENPMRD ITARYGMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDNFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFT LFTLIEKLPG RHAKTD