Gene HS_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1686 
SymbolapbE 
ID4241213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1912526 
End bp1913575 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content35% 
IMG OID638105272 
Productthiamine biosynthesis lipoprotein 
Protein accessionYP_719891 
Protein GI113461822 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAGA AATTATTATT AAATTTAATA AATATATTTG CGTTGGTATT TTTACTTAGT 
GCTTGCCAAA AAGAAGCTGA GTTAGTGTCA TTAAATGGTA GAACCATGGG TACAACTTAC
CATATCAAAT ATATTGATGA GGGCAAGACT AAGTTAAGTG TACAAAAAAT GCACGAAGGC
ATTGAAGGTA TCTTACAAGA TGTAAATGCT AAAATGTCCA CTTATATTCC TAATTCAGAG
TTAAGTGTGT TCAACAAAAA CAAGGAGATA AATAATCCCA TTGAAATTTC CGCAGATTTG
GCTTTTGTAG TTGCTGAAGC AATAAAGTTA AATCAAATTA CTCAAGGTGC TCTAGATGTA
ACAGTTGGTC CTATTGTGAA CTTATGGGGT TTTGGACCGG AAAAACGGGT AGAAAAAGCA
CCCACACCGG AACAAATAGC TGAACGAAAA GCCTGGGTAG GTATTGAGAA AGTTAGACTA
ACACAAAAAG ACAATAAATT CTTTTTGACC AAATCTGTGC CGCAGATTTA TATTGATTTA
TCTTCTATTG CTAAAGGTTT TGGTGTCGAT AAAGTTGCTG ATTATATTGC TGAGCAAGGT
ATTACTGACT ACTTAGTGGA AATTGGCGGT GAGATTCGAG CAAATGGTCA TAATGCTGAA
AATAAAGCTT GGCAAATAGC TATTGAAAAG CCAACCTTTG ATGGAACTCG ATCTGTATCA
CAAGTTGTCG GTTTACAAGA TTTGGCTATG GCAACTTCCG GGGATTATCG CAATTATTTT
GAGCAAGATG GAAAACGTTT TTCCCATGAA ATAGATCCTA CAACTTGCCA GCCCGTTCAG
CATAATTTAG CCTCAATTAC AGTCTTATCT AAAAGTGCTA TGACTGCAGA CGGCTTATCC
ACAGGTTTAT TTGTTTTAGG TGCGGAAAAA GCACTGGAAA TTGCTGAGCA AAATGATTTA
CCTATTTATT TAACGGTCAA AACTCCACAA GGGTTTGAAA ATAAAATGTC CTCTAAATTT
GCTGAAATAT TATCAACTCA GAAAAAATAA
 
Protein sequence
MTKKLLLNLI NIFALVFLLS ACQKEAELVS LNGRTMGTTY HIKYIDEGKT KLSVQKMHEG 
IEGILQDVNA KMSTYIPNSE LSVFNKNKEI NNPIEISADL AFVVAEAIKL NQITQGALDV
TVGPIVNLWG FGPEKRVEKA PTPEQIAERK AWVGIEKVRL TQKDNKFFLT KSVPQIYIDL
SSIAKGFGVD KVADYIAEQG ITDYLVEIGG EIRANGHNAE NKAWQIAIEK PTFDGTRSVS
QVVGLQDLAM ATSGDYRNYF EQDGKRFSHE IDPTTCQPVQ HNLASITVLS KSAMTADGLS
TGLFVLGAEK ALEIAEQNDL PIYLTVKTPQ GFENKMSSKF AEILSTQKK