Gene HS_1493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1493 
Symboltex 
ID4241013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1684212 
End bp1686515 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content40% 
IMG OID638105074 
Producttranscription accessory protein 
Protein accessionYP_719703 
Protein GI113461634 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAATC AACAAATCGC CGGTATTATT GCGAAAGAAT TAGCTGTTCT GCCAAGTCAA 
ATTTTATCGG CTATCCAATT ATTAGATGAT GGCAATACAA TTCCTTTCAT TGCTCGTTAT
CGTAAAGAAA TGACCGGTGG ATTAGATGAT ACCCAATTAC GTCATTTTGA AACACGTTTA
ATTTATTTAC GTGAGTTAGA AGATCGTCGC CAAACGATTC TCAACTCTAT TGAAGAACAA
GGGAAATTGA CAGATGAATT GCGTAGTCAA ATTGAACAAA CGCAAAGTAA AACTGAATTA
GAAGATCTTT ATTTACCGTA TAAACCGAAA CGTCGCACAA AAGGGCAAAT CGCAATTGAG
GCAGGAATTG AACCTTTAGC TGATTTGCTT TGGAATGCAC CTGAGAATGA ACCGGAAATC
GTTGCGGCAG ATTATATCAA TGCGGAACAA GGGTTTGCGG ATATAAAATC TGTGCTTGAT
GGTGCTCGTT ATATTTTAAT GGAGCGTTTT GCCGAAGATG CACAATTGTT AGCGAAAATC
CGCCAATATT TACAGAAAAG TGCGGTACTG GTTTCCAATG TTTTAGAAGG CAAAGAGGCT
GAGGGAGAAA AGTTCCGAGA TTATTTTGAA CATCAGGAAT TACTCCGCAA TGTTCCTTCT
CATCGTGCTT TAGCTATGTT CCGAGGGCGT AATGAAGGCT TTTTGCAATT AAGGTTGAAT
GCGGATCCTG AGCAAGAAGA AGGTGTTCGT CATAGTTATT GTGAAGAAAT TATTCGAGAG
CATTTGGGTA TTCATTTAAC TCAGCAACCA GCGGATAAGT GGCGTGAGCA GGTTATCTCG
TGGACTTGGC GGATTAAAAT CTCTTTACAT CTTGAAACTG AACTGATGAG CAGTTTACGT
GAGAAAGCTG AAGATGAGGC AATTGATGTT TTTGCCCAAA ATCTAACTTC ATTATTAATG
GCAGCACCTG CCGGAGCGAA AAATACTATG GGGCTAGATC CCGGTTTAAG GACAGGTGTA
AAAGTCGCTA TTGTTGATAA TACGGGAAAA TTAGTGGCAA CAGAGACAGT TTATCCACAT
ATCGGACAAA TGAATACGGC AATGTCAGTT ATTTATCAAT TAATTAAGCA ACATAATGTT
GAGTTAATTG CCATTGGTAA CGGCACAGCT TCAAGAGAAA CTGAACGATT TGCTAAAGAT
GTTATTAAGA AAATTGAGCA AAATAAACCA CAAACAGTTG TGGTCAGTGA AGCAGGTGCA
TCTGTTTATT CCGCATCAGA ATTGGCAGCA CAAGAGTTTC CTGAACTTGA TGTGTCTTTG
CGAGGTGCAG TTTCTATTGC TCGCCGTTTA CAAGATCCAC TGGCGGAGTT AGTGAAAATT
GAACCGAAAG CAATTGGCGT TGGGCAATAT CAACATGATG TAAACCAAAT TCAGCTTGCT
CGTAAGCTAG ATGCGGTAGT AGAAGATTGT GTAAATGCCG TTGGGGTTGA TTTAAATACG
GCATCAGCAC CTTTACTTGC TAGAGTTGCA GGTATGACTA AACATCTTGC ACAAAATATT
GTGGCATATC GTGATGAAAA TGGGCGTTTT GAAAGCCGTA ATCAGTTAAA AAAAGTACCG
CGTTTGGGTC CCAAAGCCTT TGAACAATGT GCCGGTTTTA TGCGTATTGC ACAAGGTAAA
AATCCGCTTG ATGCCTCCGG TGTTCACCCT GAGGCTTATC CTGTTGTTGA AAAGATTTTA
CAGGCTACGG AAAAATCTAT TCAAGATTTA ATGGGAAATG TGAGTGCTGT TCGACAATTA
GATGCAAAAC AATTTACTGA TGAGCAATTT GGAATGCCAA CAGTTTTAGA CATTTTCAAA
GAGTTGGAAA AACCCGGAAG GGATCCTCGA GGTCAATTTA AAACGGCAGT ATTTATGGAC
GGTGTTGAAG AAATCACTGA CTTAAAAGCA GGTATGATTT TAGAAGGATC TGTAACTAAC
GTAACGAATT TTGGGGCATT TGTTGATATC GGTGTGCATC AAGACGGTTT AGTTCATATT
TCATCTCTTT CCGACAAATT TGTGGAGAAC CCTCATGAAG TTGTGAAAAC AGGCGATATT
GTGAAAGTCA AAGTGTTGGA AGTTGATGTT GCCCGTAAGC GTATTGGACT AACCATGCGT
TTAGATGAGA ATCAGCCCAA AACCGACCGC ACTTCAGTCA AAACGACCTC AGTCCATGTA
AAAGAAGTGA ATAGAAATCG GAAAAGTAGC AATAATGTTA TGGGAAATGC GTTTGCCGAT
GCGTTAAAAA ATTGGAAAAA ATAG
 
Protein sequence
MLNQQIAGII AKELAVLPSQ ILSAIQLLDD GNTIPFIARY RKEMTGGLDD TQLRHFETRL 
IYLRELEDRR QTILNSIEEQ GKLTDELRSQ IEQTQSKTEL EDLYLPYKPK RRTKGQIAIE
AGIEPLADLL WNAPENEPEI VAADYINAEQ GFADIKSVLD GARYILMERF AEDAQLLAKI
RQYLQKSAVL VSNVLEGKEA EGEKFRDYFE HQELLRNVPS HRALAMFRGR NEGFLQLRLN
ADPEQEEGVR HSYCEEIIRE HLGIHLTQQP ADKWREQVIS WTWRIKISLH LETELMSSLR
EKAEDEAIDV FAQNLTSLLM AAPAGAKNTM GLDPGLRTGV KVAIVDNTGK LVATETVYPH
IGQMNTAMSV IYQLIKQHNV ELIAIGNGTA SRETERFAKD VIKKIEQNKP QTVVVSEAGA
SVYSASELAA QEFPELDVSL RGAVSIARRL QDPLAELVKI EPKAIGVGQY QHDVNQIQLA
RKLDAVVEDC VNAVGVDLNT ASAPLLARVA GMTKHLAQNI VAYRDENGRF ESRNQLKKVP
RLGPKAFEQC AGFMRIAQGK NPLDASGVHP EAYPVVEKIL QATEKSIQDL MGNVSAVRQL
DAKQFTDEQF GMPTVLDIFK ELEKPGRDPR GQFKTAVFMD GVEEITDLKA GMILEGSVTN
VTNFGAFVDI GVHQDGLVHI SSLSDKFVEN PHEVVKTGDI VKVKVLEVDV ARKRIGLTMR
LDENQPKTDR TSVKTTSVHV KEVNRNRKSS NNVMGNAFAD ALKNWKK