Gene HS_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0031 
Symbol 
ID4239539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp32278 
End bp33753 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content35% 
IMG OID638103562 
Productsugar ABC transporter, ATP-binding 
Protein accessionYP_718237 
Protein GI113460180 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA TCAATAAGAC ATTCCACGGT GTAAAAGCCT TAAATCGAGT TAACCTCTCT 
TTAGATTATG GAGAAGCTCT CTGCCTTGCC GGTCAAAACG GTTGTGGAAA ATCCACGCTA
ATCAAAATCC TCTCAGGTGT TTATCAACCA GATAAAGGGG CTGAAATTCA AATTGGTGCG
AGCAAATATA CCAAACTGAC GCCACAAGCT TCTATTGAAC AGGGAATTCA AGTAATCTAT
CAAGATTTAG CCCTTTTTCC TAATTTAACT GTCGCTGAAA ATATTGCAAT AAATTTACAT
CGAAAATTAG GTTGGGTCAG CCAATCAGAA ATTCATCAAG TTGCATTAAA CGCAATACTA
AGCATTAATG CAGATTTAGA TCTCAATGCT ATTTTAGAAG ATTTACCAAT TGCACAGCAA
CAATTAGTCG CTATTTGTAG AGCGCTTGCA CAAAATGCTC GACTGTTAAT TATGGATGAG
CCAACAGCAT CTCTTACTGC GAAAGAAGTA CAAGATCTGC TAAAAGTTGT ACTCAAGTTA
AAAAGTAAAG GCATTAGTAT TATTTTTGTC AGTCATAAAT TACAAGAAGT AATGAGTGTC
TCTGATACCG TTTTAGTACT TAAAAATGGG AATATGGTTG GACAATACCC TATTAGTGAA
ATGGATGAAA AACGCTTAGG ATTCTTAATG ACAGGCTTGG AAATTGACTA TAAACGGTTA
GATTTGCCCG ATTTTTCGCA AAATAGAACC GTTTTAGAAG TTCAAAATTT AACTCTGCCT
AACCAGTATG AATCCATTAA TTTCTCATTA AGAGAAGGAG AAATTATTGC CTTGACCGGT
TTACTCGGCT CAGGTCGGAC GGAACTGTGC CTTAGCTTAT TTGGAATCAC TCAACCTAAA
TCAGGTGATA TACTATTAAA TGGTGAGAAG GTTATATTTC AAAACAACCG TGATGCTATC
AAACAAGGAA TTGCTTATGT TTCTGAAGAT AGAATGACAA CTGGTTTAAT TATGACTGAA
TCCATACATC ATAATATTAT CTCTACTATT TTTCATAAAA TCACCGATAA ATTTAACATT
ATAAAATCAT CAAAAGCCTA TAATTATAGC CAGGAATTAA TTGAATCTTT AAAAATTAAA
GTAACGGATT CAGATTTGCC AGTAAATACA CTTTCCGGTG GGAATGCCCA GCGGGTGTCT
ATCGCAAAAT GGTTAGCAAT AGATCCTAGA ATTATTATTT TAGATGCTCC AACCATTGGG
GTAGATATTG CGAATAAGGA AGGAATATTC CAAATTATTC GCACATTAGC ACAAAAAGGT
ATCGCTGTTA TTTTTGTGAC CGATGAGGTA GAAGAAGCAT ACTACAACAG TCACAAAGTC
ATAGTAATGA AAAAAGGTAA AATTGTAGGT GAGATATTAC CTATCTATAC CACAGAAAAA
TCAATTGCGG AGGTTGTTTA TGAAAATCAC CAATAA
 
Protein sequence
MQNINKTFHG VKALNRVNLS LDYGEALCLA GQNGCGKSTL IKILSGVYQP DKGAEIQIGA 
SKYTKLTPQA SIEQGIQVIY QDLALFPNLT VAENIAINLH RKLGWVSQSE IHQVALNAIL
SINADLDLNA ILEDLPIAQQ QLVAICRALA QNARLLIMDE PTASLTAKEV QDLLKVVLKL
KSKGISIIFV SHKLQEVMSV SDTVLVLKNG NMVGQYPISE MDEKRLGFLM TGLEIDYKRL
DLPDFSQNRT VLEVQNLTLP NQYESINFSL REGEIIALTG LLGSGRTELC LSLFGITQPK
SGDILLNGEK VIFQNNRDAI KQGIAYVSED RMTTGLIMTE SIHHNIISTI FHKITDKFNI
IKSSKAYNYS QELIESLKIK VTDSDLPVNT LSGGNAQRVS IAKWLAIDPR IIILDAPTIG
VDIANKEGIF QIIRTLAQKG IAVIFVTDEV EEAYYNSHKV IVMKKGKIVG EILPIYTTEK
SIAEVVYENH Q