Gene HS_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1022 
Symbolrec2 
ID4240520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1128741 
End bp1131167 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content35% 
IMG OID638104583 
Productrecombination protein 2 
Protein accessionYP_719234 
Protein GI113461165 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.949854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCA ACCTTGATCA TTACCTTTTT GTTATTTTAG CGAGTGCAAT GACATTGCTA 
ATTACACCTC GTATTTTTTT ATTGAATTGG CAATGGATAT TGATCTTATG TCTTGTTCTT
GGGTTAGGAT ATATTGGTGT CAAAGCAGTA TTTTTTCGCT GTGTTCTAAA GCTATTTTTC
ATTTTTGTTC TTGGGGTGGG ATTTTTTCAT TATCAAGCAT TAAACTTGTT GGAACAAAGT
GAGCATATTA CCCGCTTACC TAAAAAAGTA CAAACTAACG TTAAAATAGC TGAAATTATT
CAGCAGAAAG ACTATCAAGC AGTGATCGCA GAAGGAAATT TCACCTCATC TTTGCCAACA
CAGCGAATTT ATTTAAATTG GCGTACTGAG CAAGAAGTTC AAGTTGGTGA AATTTGGCAA
GTAAATATGC ACATTCGTCC GATTTCTTCT CGATTAAATA TAGGAGGATT TGATCGTCAA
ACCTGGTATT TAGCAAAAGA AATTACAGCT TATGCCACAG TAAAAAGTGC GGTGAAAATT
GGTGAGGATT TTTCTTGGCG AGCTACTCGA CTGAATCAAG CTTTACAACA AACTTCCAAT
TTAGTTTCAC AAGGTTTATT GCTTGCATTG GGTTTTGGTG AAAGAGCCTG GCTAGACAAG
GAATTATGGC AACTTTATCA ACAAACCAAT ACCGCACATT TAATTGCAAT TTCAGGGTTG
CATATTGGTT TAGCTATGTT CATCGGTTTT ACATTAGGGC GAGCTATTCA ATTATTATTC
CCTACTCGTT ATATTGAACC TTATTTTCCT TTAATATTAG GGATTTTTTT AGCATTCTGT
TATGCAGAAT TAGCGGGATT TTCAATTCCT ACTTTTCGTG CGATTGTCGC TTTATTTATA
GTTTGTTTGT GTCGTATATG TCGAGTTTCC TATAATGTTT GGCAACTATT TTTGCGGGTT
ATTTGTGTTT TACTGATACT TAATCCATTT ATGTTATTGT CAGCCAGTTT TTGGCTATCA
ATCGGAGCGG TAGGTTGTTT GATCATGTGG TATCAATGGA TACCGCTAAA TTTATTTTTA
TGGAAAGAAA AGCCTTTGGC ACAATCTTCC TTGAAAAAAG TGCGGTATTT TATCGGTCTT
TTTCATTTGC AATTTGGGCT TTTGTGGTTT TTTACACCTA TCCAACTCTT AATTTTTAAT
GGTATCGCCT TAAATGGATT TTGGGCTAAT TTATTATTGT TGCCTTTGTT CAGTTTTTTG
TTAGTTCCAC TTATTCTTTT TGCTGTACTG ACGGAAGGTG CGTTAAATAG TTGGAATATA
GCTGATCAGC TTGCTATATG GATCAATCAA TTGTTAAAGC TGTTACCAAA TCAGTGGATC
AATATTTCTC TTGCAGAAAG CTATTTTATT TCAGCTATAT TAGCTTTGCT CTTCACCTTA
TGCTGTAAAT GCATTATTAA GTGTTCAACC GACGATGATT TATCTTTATT GAGAAAAGTT
AGAAAAAAAA GTTTTTACCC TATACGGTTG AATTTTTCGG ACGGTTTTTC ACATAAAAAA
TATCAATATG GTATGGTGGT CAGTTCAGCT ATGTCTGTTA TTTTTTTATG TCTGTGGTTT
TTTTCGCTAT ATGAACAAGG GCGATTAAAA AACACACAAT GGCAATTTGA TACTCTAGAT
GTTGGTCAAG GATTGGCAAG TTTATTGGTA AAAAATCAAC ATGGAATTTT GTATGATACG
GGAGCAAGCT GGAAAAATGG TAGTATGGCA AAAATTGAGA TTATTCCTTA TCTAAGAAGA
CAGGGTATTA TTCTTGATAA AGTTATATTA AGTCATGATG ATAATGATCA TGCAGGTGGA
GTAAAAGATA TTTTCCAAGC TTATCCGAAT GCTGAGTTTA TCAGTCCCTC ATTAAAAAAG
TACGAGAATT CTCCAGAAAA TAGACCGCAT ATTGCTTGTC AAAAAGGAAA AATATGGCAT
TGGCAAGGGT TATATATTGA AGCCTTATCG CCAAGTAAAA TTGTGATGAG AGCCAATAAT
CCGGATTCTT GTGTGCTGAT AATTTCAGAT GGACAGCATA AAGTATTATT AACAGGAGAT
GCTGATGTGG CGACTGAATA TAAAATTTTG TCTGACTTGG GTAAGATTGA TGTGTTACAG
GTCGGACATC ATGGTAGTAA AACGTCAACC GGTGAGAAAT TACTACAGCA TATTCAGCCT
AAAATTGCTT TAATTTCCAG TGGACGTTGG AATCCTTGGG GATTTCCGCA TCAAGATGTC
GTCAAACGCT TAAATGCGGT TGAAAGTGCG GTCTATAATA CGGCCATTTC CGGTCAAATT
CGTTTAATAT TCAAAGGAAA AGATATTCAA ATTCAAACCG CAAGGACAGA GTTTAGCCCT
TGGTATAGAG GATTAATTGG CTTGTAA
 
Protein sequence
MKFNLDHYLF VILASAMTLL ITPRIFLLNW QWILILCLVL GLGYIGVKAV FFRCVLKLFF 
IFVLGVGFFH YQALNLLEQS EHITRLPKKV QTNVKIAEII QQKDYQAVIA EGNFTSSLPT
QRIYLNWRTE QEVQVGEIWQ VNMHIRPISS RLNIGGFDRQ TWYLAKEITA YATVKSAVKI
GEDFSWRATR LNQALQQTSN LVSQGLLLAL GFGERAWLDK ELWQLYQQTN TAHLIAISGL
HIGLAMFIGF TLGRAIQLLF PTRYIEPYFP LILGIFLAFC YAELAGFSIP TFRAIVALFI
VCLCRICRVS YNVWQLFLRV ICVLLILNPF MLLSASFWLS IGAVGCLIMW YQWIPLNLFL
WKEKPLAQSS LKKVRYFIGL FHLQFGLLWF FTPIQLLIFN GIALNGFWAN LLLLPLFSFL
LVPLILFAVL TEGALNSWNI ADQLAIWINQ LLKLLPNQWI NISLAESYFI SAILALLFTL
CCKCIIKCST DDDLSLLRKV RKKSFYPIRL NFSDGFSHKK YQYGMVVSSA MSVIFLCLWF
FSLYEQGRLK NTQWQFDTLD VGQGLASLLV KNQHGILYDT GASWKNGSMA KIEIIPYLRR
QGIILDKVIL SHDDNDHAGG VKDIFQAYPN AEFISPSLKK YENSPENRPH IACQKGKIWH
WQGLYIEALS PSKIVMRANN PDSCVLIISD GQHKVLLTGD ADVATEYKIL SDLGKIDVLQ
VGHHGSKTST GEKLLQHIQP KIALISSGRW NPWGFPHQDV VKRLNAVESA VYNTAISGQI
RLIFKGKDIQ IQTARTEFSP WYRGLIGL