Gene HS_0586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0586 
SymbolxylF 
ID4240070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp623548 
End bp624549 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content37% 
IMG OID638104136 
ProductD-xylose transporter subunit XylF 
Protein accessionYP_718798 
Protein GI113460731 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTA AATCTAAGTT ATTAGCGGTC GCAACAGCAA CTTTAATGGT TTTTAGCCAT 
TCAGTGCTAG CAAACGATCT GAAAATCGGT ATGTCAATTG ATGATTTACG TTTAGAAAGA
TGGCAAAAAG ACCGAGATAT TTTTGTGAAA AAAGCAGAAG CTTTAGGTGC AAAAGTATTC
GTTCAATCTG CAAATGGTGA TGCGACAGCT CAAATTTCTC AAATTGAGAA TATGCTAAAT
AAAGATATTG ATGTGCTAGT GATTATTCCA TTCAATGGCG AAGTATTGTC AAACGTGATC
GCTGAAGCCA AAAAAGAGGG GGTTAAAGTT TTAGCTTATG ACCGTCTGAT CAATAACGCA
GATATTGATT TCTATGTTTC GTTCGATAAT GAAAAAGTAG GTGAACTACA AGCACAAAGC
ATTATTGAGA AAAAACCGAA AGGGAATTAT TTCTTAATGG GCGGTTCACC TGTTGATAAT
AACGCAAAAT TATTTCGTAA AGGTCAAATG AAAGTATTAC AACCGCACAT TGACAGTGGT
GAAATCAACG TGGTAGGCGA TCAATGGGTT GATTCTTGGC TAGCTGAAAA AGCATTACAA
ATTATGGAAA ATGCGTTAAC TGCAAACAAA AACAATATTG ATGCGGTAGT CGCTTCTAAC
GATGCAACTG CCGGTGGTGC AATTCAAGCA TTAAGTGCTC AAGGCTTATC AGGCAAAGTA
GCGATTTCAG GTCAAGATGC TGATTTAGCG GCAATCAAAC GCATTCTTGA CGGTTCACAA
ACAATGACCG TATACAAACC AATCACTAAT TTAGCAGATA AAGCAGCCGA AATTTCAGTC
GCGTTAGGTA AAGGTGGAAA AGTGGAATCC AACTCTCAAT TAAATAACGG ATTGAAAAAT
GTCCCTGCAT TCCTATTAGA GCCTGTCGTC GTTACAAAAG AGAATATTGA TGACACGGTG
ATTAAAGATG GTTTCCATAC CAAAGAGGCT GTTTATAAAT AA
 
Protein sequence
MKLKSKLLAV ATATLMVFSH SVLANDLKIG MSIDDLRLER WQKDRDIFVK KAEALGAKVF 
VQSANGDATA QISQIENMLN KDIDVLVIIP FNGEVLSNVI AEAKKEGVKV LAYDRLINNA
DIDFYVSFDN EKVGELQAQS IIEKKPKGNY FLMGGSPVDN NAKLFRKGQM KVLQPHIDSG
EINVVGDQWV DSWLAEKALQ IMENALTANK NNIDAVVASN DATAGGAIQA LSAQGLSGKV
AISGQDADLA AIKRILDGSQ TMTVYKPITN LADKAAEISV ALGKGGKVES NSQLNNGLKN
VPAFLLEPVV VTKENIDDTV IKDGFHTKEA VYK