Gene HS_1488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1488 
Symbol 
ID4241008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1679288 
End bp1680328 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content38% 
IMG OID638105069 
Producthypothetical protein 
Protein accessionYP_719698 
Protein GI113461629 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0806646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA TTTTATTCAT TGTATTGTTG TTTTTGTGTG GTGCAGGCGG TAGCGTTTTT 
TGGGCATATT GGCAAATAAC TGACTTTGTA AAACAACCTG TTAAAGTCAA AGAAGAGCAA
CTTTTAACTG TTGTGCGAGG AACGACCGGC AATAAATTGG CAATATTATT AGAAAATGAA
GGGTTAATCG AAAATGGGAA ATGGTTGCCT TGGCTGCTTA AATTAAAACC CGAATTGAAT
AAAATTAAAG CCGGTACTTA TTCCCTCGTT AATGTAGAAA ATATTCGAGA TCTTCTTGAT
GTACTTAATC AGGGCAAAGA GGTGCAATTT AATTTGCAAT TGATTGAAGG GCAACGTTTT
AAAACTTGGC GTAAAATTTT AGAAAATGCA CCGCACTTAC GGCAAACATT ACAAGGAAAA
TCGGAGAAAG AGATTTTTAC TTTGCTGGAG TTGCCGGCTT ATTCAAAAGC TGTTTATGAA
TGGAAAACGA TTGATGGTTG GTTATATCCG GATACTTATA GTTACACGCC TAACTCTAGC
GATTTGGCAC TGTTAAAACG TGCGGCTTCC CGTACCATAA AAGCGTTGGA GCGAGCGTGG
CAACAAAGAA ATGTAAATTT GCCATTGAAA AATCCCTATG AAATGTTAAT TCTTGCTTCT
ATTGTGGAAA AGGAAACAGC ATTGACTGAG GAGAGAGCGA AAGTAGCGGG CGTTTTTGTG
AATCGTTTAA ATAAGCAAAT GAAATTACAA ACAGATCCAA CGGTGATCTA TGGTATGGGT
GATAATTATA AAGGTAATAT TCGGAAAAAA GATTTATTGA CACCAACCCC TTATAATACC
TATGTGATTG ATGGTTTACC GCCGACGCCG ATTGCTATGG TAAGCGAGGA AAGTTTACAG
GCTGTTGCTA AACCGGAACA GCATGATTAT TTATATTTTG TCGCAGATGG AAGCGGTGGA
CACAAGTTTA GTAAAACATT GGCAGAACAT AACCGTGCTG TGCAAGAATA TTTGCGTTGG
TACCGTTCTC AATCAAAATA G
 
Protein sequence
MKKILFIVLL FLCGAGGSVF WAYWQITDFV KQPVKVKEEQ LLTVVRGTTG NKLAILLENE 
GLIENGKWLP WLLKLKPELN KIKAGTYSLV NVENIRDLLD VLNQGKEVQF NLQLIEGQRF
KTWRKILENA PHLRQTLQGK SEKEIFTLLE LPAYSKAVYE WKTIDGWLYP DTYSYTPNSS
DLALLKRAAS RTIKALERAW QQRNVNLPLK NPYEMLILAS IVEKETALTE ERAKVAGVFV
NRLNKQMKLQ TDPTVIYGMG DNYKGNIRKK DLLTPTPYNT YVIDGLPPTP IAMVSEESLQ
AVAKPEQHDY LYFVADGSGG HKFSKTLAEH NRAVQEYLRW YRSQSK