Gene HS_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0430 
SymbolbcgIA 
ID4239906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp457622 
End bp459592 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content32% 
IMG OID638103973 
Productrestriction enzyme, alpha subunit 
Protein accessionYP_718640 
Protein GI113460576 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.727595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAGA AAGAAATAAT GACAGATCTT TGGGTATATG ACATGCTTAA AGAAATAGGT 
GTACAAAATG ATTTTTCAGC TCAAGGAAGC ACTATAAAAG AAATAGATGA AGCATTGGCA
ACTGCATCCA AGAAAGGTAC AGGGAATGTA GGTTTTCCAG AATATGTTGG AGTAATTAAA
GATTTTCTTG TAGTAATCGA AGATAAAGCA AGTTTAAATA AGCATGTTAA TAGAGATGAA
CACGATTTAA TTGCCGAGGA TGTTAAATCC ATTACAGATT ATGCAGTTAA TGGTGCATTG
TTTTATGGAA AACATTTAGC AAAAAATACA ACTTATAAAA AAATAATAGC GATCGGTGTA
AGTGGTAATG AGAAAAACCA TAGAATTTCA CCTTTGTTTG TTGATGAAAG AGGTGGATAT
AAGGAACTTC ATGATGTTGA AACATTTATA TCTTTCAGTG CTGATAATAT CAATGAGTAT
TACGAAAGAG AAATATTAGA AGTCAAAACT AATGATGAAT TAAAGACAGA AGAATTGTTG
AAAGTTGCTC GTTCACTTCA TGAAGATTTA AGAAATTACG GAAATTTAGA AGATAAAAAC
AAGCCGCTGA TTGTTTCTGG GATTTTACTT GCATTATCAG AAATTGAATA TAAAAACTTT
GATATCTCTG ATTTGATCGG AGATAAAATA AGAACGGATG GTTCAAAAAT ATATAAGGCA
ATAGAAGATA ATTTAAAAAG AGCAAATGTT AGCCCTGAAG TTAAAAGAGA CAAGCTTCTT
AACCAATTCA ATATCATAAA AGATAATAAC AAAATTAATG AAAAAAATTC TAATCTTGGG
AAAACACCAC TTAGATATTT TACAGAGGTT CTATATAACG GCATCTTCAC AAATATAAAA
TATAATTCAT CTACAGAAGA TTATATCGGT AGATTTTATG GTGAATTTAT GTCTTATTCT
GGAGGAGATG GACAGAGTTT AGGCATTATC TTAACACCGA GACATATAAC AGATTTGTTC
TGTGAATTGC TTGATATACA GCCAACAGAT AAGGTTTTAG ACCCTTGTTG TGGTACGGCA
GGATTCTTAA TTGCCGCCAT GCACCATATG CTTTCAAAAA CGGAAGATGA AAATGAACAG
ATAGAAATTA GAAAAAATAG ACTGTTTGGT ATTGAACTTC AAGATTATAT GTTTACGATA
GCAACAACAA ATATGATATT GCGTGGAGAT GGAAAGAGCA ATTTAGAAAA TCAAGATTTT
TTAGCACAAA ATCCAAGCAA GATACAACTT AAAGGCTGTA CAGTCGGAAT GATGAATCCA
CCATATTCTC AAGGTTCAAA ACAAAACTCT GAGTTATATG AGATAAACTT TGTAAATCAT
TTATTAGAAA GTTTAGTAGA AGGAGCTAAA GTTGCTGTTA TTGTGCCGCA ATCAACTTTC
ACAGGGAAAA CTAAGGATGA GCAAAACCTT AAGACTAAAA TATTAAAAAA ACATACGCTT
GAGGGTGTTA TCACGCTTAA TAAAAATACT TTTTATGGAG TAGGAACAAA CCCTTGTATC
GGTGTTTTTA CAGCAGGCAT ACCTCATAGC AAGACCAAGA AAGCTAAGTT TATAAATTTT
GAAAATGATG GCTATATCGT AAGTAAACAT ATAGGGCTAA TTGATGACGG AAGTGCAAAA
GATAAAAAGC AACATCTTCT TGATGTGTGG AATGAAGAAA TAGAAGCACC AACAAAATTT
TGTGTCTCTA CTACAGTTGA AGATACAGAT GAATGGTTGC ACTCTTTTTA TTATTTTAAT
GATGAAATTC CTAGTGATGA GGATTTTGAG AAAACTATAG CTGATTATTT GACTTTTGAA
GTCAACATGA TTACCCACGG CAGAGGATAT TTATTTGGAC TGAATAAAGA GGAAGACTTA
TCATCAGATG AAGTCCTAAA AGTAGCAGAG GATGGTGAAA ACTATGTATA A
 
Protein sequence
MAKKEIMTDL WVYDMLKEIG VQNDFSAQGS TIKEIDEALA TASKKGTGNV GFPEYVGVIK 
DFLVVIEDKA SLNKHVNRDE HDLIAEDVKS ITDYAVNGAL FYGKHLAKNT TYKKIIAIGV
SGNEKNHRIS PLFVDERGGY KELHDVETFI SFSADNINEY YEREILEVKT NDELKTEELL
KVARSLHEDL RNYGNLEDKN KPLIVSGILL ALSEIEYKNF DISDLIGDKI RTDGSKIYKA
IEDNLKRANV SPEVKRDKLL NQFNIIKDNN KINEKNSNLG KTPLRYFTEV LYNGIFTNIK
YNSSTEDYIG RFYGEFMSYS GGDGQSLGII LTPRHITDLF CELLDIQPTD KVLDPCCGTA
GFLIAAMHHM LSKTEDENEQ IEIRKNRLFG IELQDYMFTI ATTNMILRGD GKSNLENQDF
LAQNPSKIQL KGCTVGMMNP PYSQGSKQNS ELYEINFVNH LLESLVEGAK VAVIVPQSTF
TGKTKDEQNL KTKILKKHTL EGVITLNKNT FYGVGTNPCI GVFTAGIPHS KTKKAKFINF
ENDGYIVSKH IGLIDDGSAK DKKQHLLDVW NEEIEAPTKF CVSTTVEDTD EWLHSFYYFN
DEIPSDEDFE KTIADYLTFE VNMITHGRGY LFGLNKEEDL SSDEVLKVAE DGENYV