Gene HS_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1586 
Symbol 
ID4241113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1800815 
End bp1802200 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content37% 
IMG OID638105172 
Productmajor facilitator superfamily permease 
Protein accessionYP_719791 
Protein GI113461722 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000240166 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCT TTTTGCAAAA ATATGGGTTT ACCCCAGCCT CATTTTTTCA ACTCATTCTG 
ATCACTGTTA ATGCACAATT GATTTATGCT TTTTGGGATA TTCGTAATAG CGTACCTGGT
GGTTTTCCTG CCGCTTTAGG GGTAACGGAT CAACAAGCCG GTTATTTATA TTCTATGCAA
GGACTTGTAA TTATTTTAGG GACTATTGCT TTAGGTTGGG TTGGTGACCG TTTTTCAATT
CGTTCTATTA TGTTGCTATC TACGGTTGGT GTGGGTGGAA TTTCTTTATT TCTAACCCTC
TCATCTCCAG GACTTAGCAT GCCTGTGTTA CTGGCTTGTT TCTTCTCTAT GTTATTTTTT
AGTGAGGTAT TATTTAAACC GGCTAATTTC AAAGCATTAA GAATTTCAAC CACGGAAAAA
CATCAAGGTA TGGTGTTTGG ATTATTTGAG TTTGGTCGTG GGTTGCTTGC TTTCCTTATC
TCCTTGTTAT GGACGGTGAT GCTTTATTAT AAAGTCGGTC CGAAGGCAAT GATGATGACA
AGTTGTATTA TTGTTATTAT TACTGGTATT GCAGTGTTTT TTATTGTACC TAAAGATCAA
AAAGTCGGAG ATGAAGATAC TCAAGTTAAT ACGACCAAAG AAGCTATTCA GGGTGTTGCT
CGTGTAGCTA AATTACCGGT TGTTTGGATT GCCGGAATTA ATGTGTTCTG TATTTATGGT
GCGTTTGTTG CCGCTGGGAC CTATTTTGCC CGTTTTTTAC AAGGTGGATA TGGTACAAGT
GCGGTTGCTG CGGCAGTTTT TGCAACAGTA GTTATTGCAT TGAGAATGTT ACCTTTGGTT
TCTTCCGTTT TAGTAGAAAA AGTCTTTGCT TCTACCGCAC ACTTCATGCG AATGATGCAA
ATTATTTTAG TGGTTATTCT TTCTGTAATT GGAATTATCT TTTTTACCAA TCATCCAGAT
ATTTCTTTAT ATGCTGATGG TTATATTCCA GATAATACCC CAGTAGGACT CATTTCTTCC
AGTATGTTCT GGACACTGGT AGTTCTTATG TTATGTGCAT CAGCTTGTAT CTTCATGATT
CGAGGTGTTT ATTATGCCCC AATCGGTGAG ATGGGGGTTG ATAAAAAGCA TTCCTCAGCA
GCAATGTCTT TCGCCATTAC TATTGGCTAT TTTCCTGCTT TATTAGCACC AATTGTATTA
GGTGGCTTGG TTAAATCACC GGCAAAAGAT GCTACAGGAC AAATTATCCG CTCTTATTTA
ACTGATACGC AAGTGTTAGC TTGTGCTTTC TTTGGGCTTG CGATTCTTGC GTTAATTTCT
GTTTTTATGT CACATACTTT AATTAAAATG AAACAGAAAC AGCAATTAAA AACTAGTAAT
CAATAG
 
Protein sequence
MKSFLQKYGF TPASFFQLIL ITVNAQLIYA FWDIRNSVPG GFPAALGVTD QQAGYLYSMQ 
GLVIILGTIA LGWVGDRFSI RSIMLLSTVG VGGISLFLTL SSPGLSMPVL LACFFSMLFF
SEVLFKPANF KALRISTTEK HQGMVFGLFE FGRGLLAFLI SLLWTVMLYY KVGPKAMMMT
SCIIVIITGI AVFFIVPKDQ KVGDEDTQVN TTKEAIQGVA RVAKLPVVWI AGINVFCIYG
AFVAAGTYFA RFLQGGYGTS AVAAAVFATV VIALRMLPLV SSVLVEKVFA STAHFMRMMQ
IILVVILSVI GIIFFTNHPD ISLYADGYIP DNTPVGLISS SMFWTLVVLM LCASACIFMI
RGVYYAPIGE MGVDKKHSSA AMSFAITIGY FPALLAPIVL GGLVKSPAKD ATGQIIRSYL
TDTQVLACAF FGLAILALIS VFMSHTLIKM KQKQQLKTSN Q