Gene HS_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0446 
Symbol 
ID4239922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp470536 
End bp471993 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content46% 
IMG OID638103988 
Producthypothetical protein 
Protein accessionYP_718655 
Protein GI113460591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0309285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCA CTTTATCAGC CATTCTTCTT CTTTTTCCGG TTTCCGTACT GGCGCACAGT 
CCCAAAAGTC CGAGCGAACA TTTGGACGAT CACCGCATTG CAGATGAGCG GGTACGTGAA
AACATTCAAT ATGCCTTGGC GACACAACCT AAACAAACCG TTGTGCCGAA CATTCAGCCA
CAGCAAACGG TTGCATTAAG CGAAAGCCAG TTACAGCAAC ATCCTGATTT ACTTGAGCGT
GCATTGATAG CGGCTTTATT GCAAGGTAAT GGTGAGAATG CCTCTTTGCT GTTACCGCAC
TATCAAAAGT TGCCTGAAAA CCTGCAAGAC CCGACTTTTC ACCTTTGGGC AAAAGCCTTG
ATTGCGCGTT GGCGTCATCA ATATACACAG TCCGTGCGTT TATACCGCCA AGCCTTGGCA
CAACAACCTG ATTGGTCGGT TTTGCGTTTA CAGACCGCCG CCGCCCTCTT GTCCAATAAA
GAATTCGATG CGGCAGAAGC TCAGTTTCGC AAGGTGCAAA GCGAAAATCA GCTACCCGCC
GGACTTGCCC AAGAAATTGA ATCAGTTTTG CTGTATATCA AACGGCAAAG CCGTTGGCAA
TTCAGTGGCA ACACTACCTA TATTAACGAC AAAAATATTA ACAACGCTCC CAAAAATCCT
GATCTGGGCG GAGGTTGGCG AGGTGATCAG GCTGAGTCCG GTCAAGGGTT AGCCGTTAAT
TTGGGCACCA ATAAAAAATG GTTTTGGAAA AATGGATTAT TTAATGAATG GCGTTTAGAC
AGCAACAGTA AATTTTATTG GAACAACAAA CGATTCAACG AAGCCAATGT GCGTGCTTCA
ATGGGTATCG GTTATCAAAA TGCTAAAAAT AGCATTACCG TTCTGCCATT TTTTGAGCAA
GCGTGGTATG CAGGGGGCAA GAAAGGCAAT GAGACCTTAC GACGTTTTTC CAACAGTCGA
GGTATTGCAC TAGAAGCAAC CCACACTTTT AGCCCCAAAT GGCAGGGGAG TCTGACAGCT
GAGACGGCAC AAAATCGTTA TCGGACACGT AAGCATTTAA ACGGTAATAC GCACTTTGTT
TCTTTATCGG CGGTGTATCA ACACAATCCG AGTCAAGCTT GGTTTGGAGG AATAGACTGG
CATCGCAACA ACGCACGAGA TGGCGATGAT TCTTTTGACC GTATCGGGGT ACGGGCAGGC
TGGTTGCAAG ACTGGAAAGG ACTTTCCACA CGCTTAATTA CTTCTTATGG CAAAAAAAAC
TATCGCAGTG CAGGCTTTTT CAACAAAACC CAACGTAATC GAGAGTTGGG CGTACAGGTC
AGTGTATGGC ATCGAGCCGT ACACTGGCAA GGTTTAACAC CACGGTTAAC ATGGTCATAC
ACTAAAACGG ATAGTAACAT ACCATTGTTC CGTTACAACA AACAGCGCCT GTTTCTGGAA
ATTAATAAGC AGTTTTGA
 
Protein sequence
MKLTLSAILL LFPVSVLAHS PKSPSEHLDD HRIADERVRE NIQYALATQP KQTVVPNIQP 
QQTVALSESQ LQQHPDLLER ALIAALLQGN GENASLLLPH YQKLPENLQD PTFHLWAKAL
IARWRHQYTQ SVRLYRQALA QQPDWSVLRL QTAAALLSNK EFDAAEAQFR KVQSENQLPA
GLAQEIESVL LYIKRQSRWQ FSGNTTYIND KNINNAPKNP DLGGGWRGDQ AESGQGLAVN
LGTNKKWFWK NGLFNEWRLD SNSKFYWNNK RFNEANVRAS MGIGYQNAKN SITVLPFFEQ
AWYAGGKKGN ETLRRFSNSR GIALEATHTF SPKWQGSLTA ETAQNRYRTR KHLNGNTHFV
SLSAVYQHNP SQAWFGGIDW HRNNARDGDD SFDRIGVRAG WLQDWKGLST RLITSYGKKN
YRSAGFFNKT QRNRELGVQV SVWHRAVHWQ GLTPRLTWSY TKTDSNIPLF RYNKQRLFLE
INKQF