Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1493 |
Symbol | tex |
ID | 4241013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1684212 |
End bp | 1686515 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638105074 |
Product | transcription accessory protein |
Protein accession | YP_719703 |
Protein GI | 113461634 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAATC AACAAATCGC CGGTATTATT GCGAAAGAAT TAGCTGTTCT GCCAAGTCAA ATTTTATCGG CTATCCAATT ATTAGATGAT GGCAATACAA TTCCTTTCAT TGCTCGTTAT CGTAAAGAAA TGACCGGTGG ATTAGATGAT ACCCAATTAC GTCATTTTGA AACACGTTTA ATTTATTTAC GTGAGTTAGA AGATCGTCGC CAAACGATTC TCAACTCTAT TGAAGAACAA GGGAAATTGA CAGATGAATT GCGTAGTCAA ATTGAACAAA CGCAAAGTAA AACTGAATTA GAAGATCTTT ATTTACCGTA TAAACCGAAA CGTCGCACAA AAGGGCAAAT CGCAATTGAG GCAGGAATTG AACCTTTAGC TGATTTGCTT TGGAATGCAC CTGAGAATGA ACCGGAAATC GTTGCGGCAG ATTATATCAA TGCGGAACAA GGGTTTGCGG ATATAAAATC TGTGCTTGAT GGTGCTCGTT ATATTTTAAT GGAGCGTTTT GCCGAAGATG CACAATTGTT AGCGAAAATC CGCCAATATT TACAGAAAAG TGCGGTACTG GTTTCCAATG TTTTAGAAGG CAAAGAGGCT GAGGGAGAAA AGTTCCGAGA TTATTTTGAA CATCAGGAAT TACTCCGCAA TGTTCCTTCT CATCGTGCTT TAGCTATGTT CCGAGGGCGT AATGAAGGCT TTTTGCAATT AAGGTTGAAT GCGGATCCTG AGCAAGAAGA AGGTGTTCGT CATAGTTATT GTGAAGAAAT TATTCGAGAG CATTTGGGTA TTCATTTAAC TCAGCAACCA GCGGATAAGT GGCGTGAGCA GGTTATCTCG TGGACTTGGC GGATTAAAAT CTCTTTACAT CTTGAAACTG AACTGATGAG CAGTTTACGT GAGAAAGCTG AAGATGAGGC AATTGATGTT TTTGCCCAAA ATCTAACTTC ATTATTAATG GCAGCACCTG CCGGAGCGAA AAATACTATG GGGCTAGATC CCGGTTTAAG GACAGGTGTA AAAGTCGCTA TTGTTGATAA TACGGGAAAA TTAGTGGCAA CAGAGACAGT TTATCCACAT ATCGGACAAA TGAATACGGC AATGTCAGTT ATTTATCAAT TAATTAAGCA ACATAATGTT GAGTTAATTG CCATTGGTAA CGGCACAGCT TCAAGAGAAA CTGAACGATT TGCTAAAGAT GTTATTAAGA AAATTGAGCA AAATAAACCA CAAACAGTTG TGGTCAGTGA AGCAGGTGCA TCTGTTTATT CCGCATCAGA ATTGGCAGCA CAAGAGTTTC CTGAACTTGA TGTGTCTTTG CGAGGTGCAG TTTCTATTGC TCGCCGTTTA CAAGATCCAC TGGCGGAGTT AGTGAAAATT GAACCGAAAG CAATTGGCGT TGGGCAATAT CAACATGATG TAAACCAAAT TCAGCTTGCT CGTAAGCTAG ATGCGGTAGT AGAAGATTGT GTAAATGCCG TTGGGGTTGA TTTAAATACG GCATCAGCAC CTTTACTTGC TAGAGTTGCA GGTATGACTA AACATCTTGC ACAAAATATT GTGGCATATC GTGATGAAAA TGGGCGTTTT GAAAGCCGTA ATCAGTTAAA AAAAGTACCG CGTTTGGGTC CCAAAGCCTT TGAACAATGT GCCGGTTTTA TGCGTATTGC ACAAGGTAAA AATCCGCTTG ATGCCTCCGG TGTTCACCCT GAGGCTTATC CTGTTGTTGA AAAGATTTTA CAGGCTACGG AAAAATCTAT TCAAGATTTA ATGGGAAATG TGAGTGCTGT TCGACAATTA GATGCAAAAC AATTTACTGA TGAGCAATTT GGAATGCCAA CAGTTTTAGA CATTTTCAAA GAGTTGGAAA AACCCGGAAG GGATCCTCGA GGTCAATTTA AAACGGCAGT ATTTATGGAC GGTGTTGAAG AAATCACTGA CTTAAAAGCA GGTATGATTT TAGAAGGATC TGTAACTAAC GTAACGAATT TTGGGGCATT TGTTGATATC GGTGTGCATC AAGACGGTTT AGTTCATATT TCATCTCTTT CCGACAAATT TGTGGAGAAC CCTCATGAAG TTGTGAAAAC AGGCGATATT GTGAAAGTCA AAGTGTTGGA AGTTGATGTT GCCCGTAAGC GTATTGGACT AACCATGCGT TTAGATGAGA ATCAGCCCAA AACCGACCGC ACTTCAGTCA AAACGACCTC AGTCCATGTA AAAGAAGTGA ATAGAAATCG GAAAAGTAGC AATAATGTTA TGGGAAATGC GTTTGCCGAT GCGTTAAAAA ATTGGAAAAA ATAG
|
Protein sequence | MLNQQIAGII AKELAVLPSQ ILSAIQLLDD GNTIPFIARY RKEMTGGLDD TQLRHFETRL IYLRELEDRR QTILNSIEEQ GKLTDELRSQ IEQTQSKTEL EDLYLPYKPK RRTKGQIAIE AGIEPLADLL WNAPENEPEI VAADYINAEQ GFADIKSVLD GARYILMERF AEDAQLLAKI RQYLQKSAVL VSNVLEGKEA EGEKFRDYFE HQELLRNVPS HRALAMFRGR NEGFLQLRLN ADPEQEEGVR HSYCEEIIRE HLGIHLTQQP ADKWREQVIS WTWRIKISLH LETELMSSLR EKAEDEAIDV FAQNLTSLLM AAPAGAKNTM GLDPGLRTGV KVAIVDNTGK LVATETVYPH IGQMNTAMSV IYQLIKQHNV ELIAIGNGTA SRETERFAKD VIKKIEQNKP QTVVVSEAGA SVYSASELAA QEFPELDVSL RGAVSIARRL QDPLAELVKI EPKAIGVGQY QHDVNQIQLA RKLDAVVEDC VNAVGVDLNT ASAPLLARVA GMTKHLAQNI VAYRDENGRF ESRNQLKKVP RLGPKAFEQC AGFMRIAQGK NPLDASGVHP EAYPVVEKIL QATEKSIQDL MGNVSAVRQL DAKQFTDEQF GMPTVLDIFK ELEKPGRDPR GQFKTAVFMD GVEEITDLKA GMILEGSVTN VTNFGAFVDI GVHQDGLVHI SSLSDKFVEN PHEVVKTGDI VKVKVLEVDV ARKRIGLTMR LDENQPKTDR TSVKTTSVHV KEVNRNRKSS NNVMGNAFAD ALKNWKK
|
| |