Gene HS_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1601 
SymboluidB 
ID4241128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1817191 
End bp1818606 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content36% 
IMG OID638105187 
Productglucuronide permease 
Protein accessionYP_719806 
Protein GI113461737 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCAA CACAAAGACC TTTTGGTTTA AAAGACAAAC TTGCCTATAT GGCTGGCGAT 
ATTGCTAATG ATCTCAGTTT TATGATGTCC GCCTTCTTTT TAATGCTATT TTATACTAAT
GTGCTACAAA TTGAAGGTTA TGTTGTCGGT CTTCTATTTC TAGTTTCTAG ATTTATTGAC
GCTTTTACTG ATATTGGTAT GGGACGTTTA GTAGATACTA TAAAGCCTTT TAAAGAGGGG
CGTTTCAGAG GTCTTATTCG CCGGGCAACT CCCTTTATCT GTATTTCAGG ATTTCTGCTT
TTCTTGCACA TTGTGAAAGA TTGGTCTTAT ACCGCAAAAT TGGTCTACAT CACTGTTACC
TACATAGTTT GGGGTAGTTT AGCCTATACT GCGGTCAATA TTCCTTATGG TTCAATGGCT
TCCGTGATTA CTACAAAAGC TGATGAGCGT GCCGGATTAT CTATCTTCCG TACAGTGGGT
GCAAATATTG CGGTACTTTT TATCTCATTT GTTATTCCAC TCATTATTTA CAAAGAAGTT
GAAGGTAAAC AAGTAATTAT CCCAGAAATG TTCACATACA TTATGGGAGC ATTTATGATC
TGTGCGTTTA TTTTGTACCA AATCTGTTGG AGATTTTCGG TTGAGCGAGT TCAATTGCCT
GAACAAGAAG GTCGTCGCCA CAACAAATCA CACAATAAAT CAGACTGTTT AGATGACGTA
AAAGCAATCT TTAGCAGTCT ATTCTCAAAC AGTGCATTAC TTATCTTTAT TTTAATCGCT
ATCATTTTAC TGCTAGCAAC CTTAATCATC GGTACAATGA ACCCATATTT GTATGTTGAT
TACTTTAATA GTAAATTAGC ATTGTCATTT GGTGGTATTT TAGGCGCCGT GACAACATTT
ATGGTTGCTC CGTTTGCCCA AAATATTGTG AAAAAGTACG GTAAAAAAGA ATCAGCTTCT
GTAGGTTTAT TGATAACTGC TGTCATATAC AGTGTGCTAT TCTTCGTTAA AATCACAAAT
GTTTGGTTAT TTATCATTGT TGCACTCATT GCAACACTGG GTCTAAGTTA TTTCCAAATT
ATTATATGGG CATTTATTAC AGATATTATT GATAACCAAT TTATCAAAAC TGGACGCCGT
GAAGATGGCA CAATTTATGC AGTTTATTCA TTTGCTCGCA AAATCGGTCA AGCTTTAGCT
GGTGGTTTAG GTGGCTTTGC ACTAAGTTAC ATTGGCTATT CTGCAAAAAT CCCACAACAA
CCACAAGAAG TGTTAGAGTC AATCTATAAC TTTGCAACGG GAGTACCTGC ATTGGCTTGC
ATCTTGATTT TTTTACTGTT GAAATACGTT TACCCGCTTT CTAAAGAAAA AGTGGACGAA
AACGCAAGTA TATTAGAACA AAAAATAACT CAATAA
 
Protein sequence
MSSTQRPFGL KDKLAYMAGD IANDLSFMMS AFFLMLFYTN VLQIEGYVVG LLFLVSRFID 
AFTDIGMGRL VDTIKPFKEG RFRGLIRRAT PFICISGFLL FLHIVKDWSY TAKLVYITVT
YIVWGSLAYT AVNIPYGSMA SVITTKADER AGLSIFRTVG ANIAVLFISF VIPLIIYKEV
EGKQVIIPEM FTYIMGAFMI CAFILYQICW RFSVERVQLP EQEGRRHNKS HNKSDCLDDV
KAIFSSLFSN SALLIFILIA IILLLATLII GTMNPYLYVD YFNSKLALSF GGILGAVTTF
MVAPFAQNIV KKYGKKESAS VGLLITAVIY SVLFFVKITN VWLFIIVALI ATLGLSYFQI
IIWAFITDII DNQFIKTGRR EDGTIYAVYS FARKIGQALA GGLGGFALSY IGYSAKIPQQ
PQEVLESIYN FATGVPALAC ILIFLLLKYV YPLSKEKVDE NASILEQKIT Q