Gene SeHA_C0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0113 
SymbolthiP 
ID6490442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp119300 
End bp120910 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID642740401 
Productthiamine transporter membrane protein 
Protein accessionYP_002044075 
Protein GI194449238 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.195449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGCTGA TTCCGGGGCT GTGCGCCGCC 
GCGCTGATGA TAACCGTCTC GCTGGCGGCC TTTCTGGCGC TTTGGCTGAA TGCGCCGTCG
GGCGCGTGGT CGACAATCTG GCGGGATAGC TACCTGTGGC ATGTGGTGCG CTTCTCATTC
TGGCAGGCGT TTCTGTCCGC AGTGCTGTCT GTGGTCCCGG CGGTTTTTCT CGCCCGCGCG
CTTTATCGAC GCCGTTTTCC TGGACGTCTG GCGCTGCTGC GTCTGTGCGC CATGACGCTG
ATTCTGCCGG TACTGGTTGC CGTGTTCGGC ATTCTTAGCG TGTATGGCCG TCAGGGCTGG
CTGGCCTCGC TGTGGCAGAT GCTGGGGCTT CAGTGGACAT TCTCCCCCTA CGGCTTGCAG
GGCATTTTAC TGGCGCACGT CTTTTTTAAC CTGCCAATGG CGAGCCGTTT GTTGCTGCAA
TCTTTGGAAA GCATTCCCGG CGAGCAGCGC CAGCTCGCCG CCCAGCTCGG TATGCGCGGC
TGGCATTTTT TCCGTTTTGT CGAATGGCCG TGGCTGCGCC GCCAAATTCC GCCTGTCGCG
GCGCTGATTT TTATGCTCTG CTTCGCCAGT TTCGCAACGG TTCTGTCGCT CGGCGGCGGC
CCGCAGGCCA CCACTATCGA ACTGGCTATC TTTCAGGCGC TCAGTTATGA CTACGATCCC
GCCCGCGCGG CGATGCTGGC ATTAATCCAG ATGGTCTGCT GCCTGGCGCT GGTACTGCTA
AGCCAACGGT TGAGCAAAGC GATTGCGCCG GGGATGACGC TGACGCAGGG CTGGCGCGAT
CCTGACGATC GTCTCCACAG CCGTCTGACG GACGCCTTAT TAATCGTGCT GGCGCTGCTG
CTGCTGCTTC CGCCGCTGGT CGCCGTGGTG GTCGATGGCG TCAACCGCAG CCTGCCGGAG
GTGCTGGCGC AACCCATTCT GTGGCAGGCT GTCTGGACAT CGCTACGCAT TGCGCTGGCG
GCGGGCGTTC TGTGTGTGGT GCTGACCATG ATGCTGCTGT GGAGTAGCCG CGAGCTACGC
CAACGCCAGC AACTATTCGC CGGACAAACG CTGGAACTCA GCGGAATGTT GATCCTCGCG
ATGCCGGGGA TCGTGCTGGC GACAGGCTTC TTTTTACTGC TCAATAACAG CGTTGGCTTA
CCGGAATCCG CCGACGGCAT CGTGATTTTC ACCAATGCGC TGATGGCAAT CCCCTACGCG
CTAAAAGTCC TGGAAAACCC AATGCGCGAT ATTACCGCCC GTTATGGAAT GCTGTGCCAG
TCATTAGGAA TTGAAGGCTG GTCGCGGTTA AAGATCGTGG AACTGCGGGC GCTGAAACGT
CCGCTGGCGC AGGCGCTGGC GTTTGCCTGC GTACTTTCTA TCGGTGATTT TGGCGTCGTC
GCGCTTTTCG GCAATGACAA TTTCCGCACG CTGCCGTTTT ATCTGTATCA GCAGATCGGC
TCCTACCGCA GCCAGGACGG CGCAGTGACC GCGCTGATAC TGCTGCTGCT CTGCTTTACG
TTATTTACCC TCATAGAGAA ACTACCGGGC CGACATGCTA AAACTGATTG A
 
Protein sequence
MATRRQPLIP GWLIPGLCAA ALMITVSLAA FLALWLNAPS GAWSTIWRDS YLWHVVRFSF 
WQAFLSAVLS VVPAVFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LASLWQMLGL QWTFSPYGLQ GILLAHVFFN LPMASRLLLQ SLESIPGEQR QLAAQLGMRG
WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI FQALSYDYDP
ARAAMLALIQ MVCCLALVLL SQRLSKAIAP GMTLTQGWRD PDDRLHSRLT DALLIVLALL
LLLPPLVAVV VDGVNRSLPE VLAQPILWQA VWTSLRIALA AGVLCVVLTM MLLWSSRELR
QRQQLFAGQT LELSGMLILA MPGIVLATGF FLLLNNSVGL PESADGIVIF TNALMAIPYA
LKVLENPMRD ITARYGMLCQ SLGIEGWSRL KIVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDNFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFT LFTLIEKLPG RHAKTD