Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0430 |
Symbol | bcgIA |
ID | 4239906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 457622 |
End bp | 459592 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638103973 |
Product | restriction enzyme, alpha subunit |
Protein accession | YP_718640 |
Protein GI | 113460576 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.727595 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAGA AAGAAATAAT GACAGATCTT TGGGTATATG ACATGCTTAA AGAAATAGGT GTACAAAATG ATTTTTCAGC TCAAGGAAGC ACTATAAAAG AAATAGATGA AGCATTGGCA ACTGCATCCA AGAAAGGTAC AGGGAATGTA GGTTTTCCAG AATATGTTGG AGTAATTAAA GATTTTCTTG TAGTAATCGA AGATAAAGCA AGTTTAAATA AGCATGTTAA TAGAGATGAA CACGATTTAA TTGCCGAGGA TGTTAAATCC ATTACAGATT ATGCAGTTAA TGGTGCATTG TTTTATGGAA AACATTTAGC AAAAAATACA ACTTATAAAA AAATAATAGC GATCGGTGTA AGTGGTAATG AGAAAAACCA TAGAATTTCA CCTTTGTTTG TTGATGAAAG AGGTGGATAT AAGGAACTTC ATGATGTTGA AACATTTATA TCTTTCAGTG CTGATAATAT CAATGAGTAT TACGAAAGAG AAATATTAGA AGTCAAAACT AATGATGAAT TAAAGACAGA AGAATTGTTG AAAGTTGCTC GTTCACTTCA TGAAGATTTA AGAAATTACG GAAATTTAGA AGATAAAAAC AAGCCGCTGA TTGTTTCTGG GATTTTACTT GCATTATCAG AAATTGAATA TAAAAACTTT GATATCTCTG ATTTGATCGG AGATAAAATA AGAACGGATG GTTCAAAAAT ATATAAGGCA ATAGAAGATA ATTTAAAAAG AGCAAATGTT AGCCCTGAAG TTAAAAGAGA CAAGCTTCTT AACCAATTCA ATATCATAAA AGATAATAAC AAAATTAATG AAAAAAATTC TAATCTTGGG AAAACACCAC TTAGATATTT TACAGAGGTT CTATATAACG GCATCTTCAC AAATATAAAA TATAATTCAT CTACAGAAGA TTATATCGGT AGATTTTATG GTGAATTTAT GTCTTATTCT GGAGGAGATG GACAGAGTTT AGGCATTATC TTAACACCGA GACATATAAC AGATTTGTTC TGTGAATTGC TTGATATACA GCCAACAGAT AAGGTTTTAG ACCCTTGTTG TGGTACGGCA GGATTCTTAA TTGCCGCCAT GCACCATATG CTTTCAAAAA CGGAAGATGA AAATGAACAG ATAGAAATTA GAAAAAATAG ACTGTTTGGT ATTGAACTTC AAGATTATAT GTTTACGATA GCAACAACAA ATATGATATT GCGTGGAGAT GGAAAGAGCA ATTTAGAAAA TCAAGATTTT TTAGCACAAA ATCCAAGCAA GATACAACTT AAAGGCTGTA CAGTCGGAAT GATGAATCCA CCATATTCTC AAGGTTCAAA ACAAAACTCT GAGTTATATG AGATAAACTT TGTAAATCAT TTATTAGAAA GTTTAGTAGA AGGAGCTAAA GTTGCTGTTA TTGTGCCGCA ATCAACTTTC ACAGGGAAAA CTAAGGATGA GCAAAACCTT AAGACTAAAA TATTAAAAAA ACATACGCTT GAGGGTGTTA TCACGCTTAA TAAAAATACT TTTTATGGAG TAGGAACAAA CCCTTGTATC GGTGTTTTTA CAGCAGGCAT ACCTCATAGC AAGACCAAGA AAGCTAAGTT TATAAATTTT GAAAATGATG GCTATATCGT AAGTAAACAT ATAGGGCTAA TTGATGACGG AAGTGCAAAA GATAAAAAGC AACATCTTCT TGATGTGTGG AATGAAGAAA TAGAAGCACC AACAAAATTT TGTGTCTCTA CTACAGTTGA AGATACAGAT GAATGGTTGC ACTCTTTTTA TTATTTTAAT GATGAAATTC CTAGTGATGA GGATTTTGAG AAAACTATAG CTGATTATTT GACTTTTGAA GTCAACATGA TTACCCACGG CAGAGGATAT TTATTTGGAC TGAATAAAGA GGAAGACTTA TCATCAGATG AAGTCCTAAA AGTAGCAGAG GATGGTGAAA ACTATGTATA A
|
Protein sequence | MAKKEIMTDL WVYDMLKEIG VQNDFSAQGS TIKEIDEALA TASKKGTGNV GFPEYVGVIK DFLVVIEDKA SLNKHVNRDE HDLIAEDVKS ITDYAVNGAL FYGKHLAKNT TYKKIIAIGV SGNEKNHRIS PLFVDERGGY KELHDVETFI SFSADNINEY YEREILEVKT NDELKTEELL KVARSLHEDL RNYGNLEDKN KPLIVSGILL ALSEIEYKNF DISDLIGDKI RTDGSKIYKA IEDNLKRANV SPEVKRDKLL NQFNIIKDNN KINEKNSNLG KTPLRYFTEV LYNGIFTNIK YNSSTEDYIG RFYGEFMSYS GGDGQSLGII LTPRHITDLF CELLDIQPTD KVLDPCCGTA GFLIAAMHHM LSKTEDENEQ IEIRKNRLFG IELQDYMFTI ATTNMILRGD GKSNLENQDF LAQNPSKIQL KGCTVGMMNP PYSQGSKQNS ELYEINFVNH LLESLVEGAK VAVIVPQSTF TGKTKDEQNL KTKILKKHTL EGVITLNKNT FYGVGTNPCI GVFTAGIPHS KTKKAKFINF ENDGYIVSKH IGLIDDGSAK DKKQHLLDVW NEEIEAPTKF CVSTTVEDTD EWLHSFYYFN DEIPSDEDFE KTIADYLTFE VNMITHGRGY LFGLNKEEDL SSDEVLKVAE DGENYV
|
| |