Gene HS_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1012 
SymbolthiP 
ID4240505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1115127 
End bp1116671 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content35% 
IMG OID638104568 
Productthiamine transporter membrane protein 
Protein accessionYP_719223 
Protein GI113461155 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.775062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGTTAT TTATTGTTTC ATTTTATGGT CTGGTATTGA GCGGTATTTT TACTGTTGAA 
ACTGATTATA AATGGACGGA ACTGATACGA GATGATTACA TCAAGCAAGT TATTTTATTT
AGTTTTGGAC AGGCTACCTT ATCTTCCTTA CTTTCCGTTT TTTTGGGCTT ATTATTCGCT
CGAGCCTTTT TCTATCAAAG GTTTATAGCA AAACCTTTGA TTCTTAAGCT ATTTTCGTTA
ACTTTTGTGT TGCCTGCATT AGTTGTAATT TTTGGTATTA CAGGAGTTTA TGGACATAAC
GGCTGGCTGG TTAAATTAAC AACCTTTTTA GGCATAACGT GGCAACCCCA TATTTATGGT
TTAAGCGGTA TTTTAATTGC ACATTTATTT TTTAATATTC CTTTAGCAGC AAGAATGTTT
TTGCAAACAT TCCAAACAAT TCCTACTCAG CAAAGGCAAC TGGCATCACA ATTGAATCTT
CGTGGTTGGC AGTTTATTCG TTTGATTGAA TTTCCCTATT TACGCCAACA ATTATTACCG
GTTTTTGGCT TGATTTTTAT GCTGTGTTTC ACCAGTTTTT CTATTGTTCT CACCTTAGGC
GGCGGTCCCA AATACACTAC ATTGGAAGTG GCTATTTACC AAGCCGTTTT ATTTGAATTT
GATTTGGCGA AGTCGGCTTT ATTTGCGTTG TTACAAGTAA TTTTTTGTTG TGTGCTGTTT
GCATTGGGTA GCCTATGGCA AAAATCTCCA CAAGTGTTAT TGCATAGTAA AAATATTTGG
TTAGAAAAGC AGTCAAGAGC GGTACAAATC TGGCAAATTT TTTATATCAA TTGCGTTTTA
TTGTTCATCA GCTTGCCTTT AATAAATATT GTTTTCTCCG CTTTTCAAGC ACAAAGTTTA
TGGCAAATTT GGCAGCAATC ACAACTTTGG CAGGCTTTAG GTTATTCGTT AGCTATTGCA
CCTTTATCAG CGATTTTAGC GATGTTATTT TCCGTTTCAT TATTATTACT GGCACGCCGT
TTACACTATT TGTCTTATTC ATTTTTATCT CAGAGTATTT TAAATAGCGG AATGCTGGTG
CTGGCTATCC CTGTCTTGGT TATTTCAGTA GGTTTATTTA TTCGCTTAAG GGAAATGGAT
TTCTCCCACT ATCATTTATT TGCTTTAGTC GTACTATGTA ATGGCTTAAC CGCCATGCCT
TTTGTTTTAC AGGTTCTAAA GTTACCGATG TACAACAATA TGCAACATTA CGAAAAACTA
TCGCAATCCT TGGCTATTCA AGGTTGGAAT AGATTTTATT TAATTGAATG GCATAATTTA
AAAGCATCAT TTAAATATGC TTTTGCACTG GCTTGTGCTA TTTCCTTAGG GGATTTTACT
GCCATAGCTT TATTCGGGAA TCAGGATTTT AGTTCCTTGC CTTATTTGTT ATATCAACAA
TTAGGTAGTT ATCGCTCTGA CGAAGGTGCA GTAACTGCAT TACTGTTACT CGTTTTTTGT
ACATGTATTT TTATATTAAT TGAACGAATA AAAAAAGATG ATTAA
 
Protein sequence
MWLFIVSFYG LVLSGIFTVE TDYKWTELIR DDYIKQVILF SFGQATLSSL LSVFLGLLFA 
RAFFYQRFIA KPLILKLFSL TFVLPALVVI FGITGVYGHN GWLVKLTTFL GITWQPHIYG
LSGILIAHLF FNIPLAARMF LQTFQTIPTQ QRQLASQLNL RGWQFIRLIE FPYLRQQLLP
VFGLIFMLCF TSFSIVLTLG GGPKYTTLEV AIYQAVLFEF DLAKSALFAL LQVIFCCVLF
ALGSLWQKSP QVLLHSKNIW LEKQSRAVQI WQIFYINCVL LFISLPLINI VFSAFQAQSL
WQIWQQSQLW QALGYSLAIA PLSAILAMLF SVSLLLLARR LHYLSYSFLS QSILNSGMLV
LAIPVLVISV GLFIRLREMD FSHYHLFALV VLCNGLTAMP FVLQVLKLPM YNNMQHYEKL
SQSLAIQGWN RFYLIEWHNL KASFKYAFAL ACAISLGDFT AIALFGNQDF SSLPYLLYQQ
LGSYRSDEGA VTALLLLVFC TCIFILIERI KKDD