Gene HS_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1474 
SymbolrecJ 
ID4240994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1665695 
End bp1667416 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content39% 
IMG OID638105055 
Productsingle-strand DNA-specific exonuclease 
Protein accessionYP_719684 
Protein GI113461615 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAGT TTATTAAACG ACGTGATGTT GCGATAGATG TTCAAGTTTG TGAACATCCT 
TTATTGGACC GAATTTACCG TGCACGTCAA ATTAAAAATC CTCAACAATT AGACCGCACT
TTGAAATCTA TGTTGGCACC TGATTTGCTA CATGGTATTG GAAGTGCGGT GGATTTATTG
GTAACAGCGA GAGAAAAACA ACAAAAGATT GTTATTGTCG GTGATTTTGA TGTTGACGGA
GCAACCAGCA CAGCACTCAT TGTTTTGGCT CTTCGTCAAC TTGGGTTCAC AGATGTAGAT
TATTTAGTAC CTAATCGCTT TGAGCAAGGT TACGGTTTAA GCGTGGAAGT AGCTAAATTA
GCATTAGAAA AAAATGTTGA GTTATTGATT ACAGTAGATA ATGGGGTATC TTCTTTTGAT
GGTGTTGTGT TTTTAAAAAA TGCGGGAGTG CAAGTGTTGA TTACAGATCA CCACTTACCG
CCGGAAATAT TGCCTCCTGC CCATGCTATT GTTAATCCAA ATCTTACACA ATGTAATTTT
CCTTCAAAAT TTCTTGCTGG TGTCGGTGTC GCTTTTTATG TCATGCTGGC ATTGCGTGCA
AAATTACGTG ATTTAGGGAT TTTTACTACG CAAACCCAAC CGAATTTTAC AGAGTTATTA
GATTTAGTTG CCCTTGGTAC AATTTCCGAT GTGGTACCGC TTGATCAAAA TAATCGTATT
TTAGCTTATC AAGGTTTAGC CAGAATTCGA GCTGAGCGTT GTCGTCCCGG TATTCGTGCT
TTGGCGGAAA TTGCAAATCG TAATATGAGT CGATTGACCG CTTCCGATTT AGGCTTTTCT
ATTGGTCCAA GATTGAATGC TGCCGGGCGT TTGGACAATA TGTCTGTTGG TGTGGAGTTA
TTATTGGCCG ATGATATGCA ACATGCTCGT CAATTAGCTT TTGAATTAGA TAGTTTGAAT
CAGGCTCGTA AAGAAATTGA GCAAGGTATG AAACAGGAAG CGTTAGAAAT TTGTCGGAAT
TTGACCGCAC TTTCTAAGCA ATTACCGCTG GGCATCGTAC TATATCAGGT TGATTGGCAT
CAAGGTGTAC TCGGTATTTT GGCTTCTCGA ATTAAAGATA AATTTCATCG ACCGACCATT
GCGTTTGCTC AGGATCAAGT CGGAATTTTA AAAGGTTCAG CACGATCTAT TGAAGGTTTA
CATATGCGTG ATGTGTTAGA GCGTATTCAT TCTCGCTATC CGAATATGAT TTTAAAATTT
GGTGGTCATG CGATGGCTGC AGGGTTGAGT ATTCGTGAGG AATTGTTTAC ACAATTTCAA
CAAGCCTTTA TTCAAACTGT GACTGAATGG TTAAAAGAAG AACAATTACA AGGTATTATT
TGGACTGATG GTCAATTAGC ACCCGCGTTT TTAAATTTGG AAACAGCGGA ATTACTGCGT
CAAGCCGGTC CTTGGGGGCA AGGTTTTCCA GAGCCGTGTT TTGATGGTGA ATTTACAATT
TTACGCCAGA GTATTGTGGG TGAAAAACAT TTAAAAATGC TTGTTGAGCC TAAACAAGGA
GGTCCGCTGT TGGATGCTAT TGCATTTAAT ATTGATAAGG ATTACTACCC TGATTTTTCG
ATAAAACATG CAAGAATAGC TTATAAATTA GATAGTAATG AGTTTCGTGG AAATCGTCAT
GTTCAGTTGT TGGTAGATTA TATTGAACCT TTAGATCATT AA
 
Protein sequence
MNKFIKRRDV AIDVQVCEHP LLDRIYRARQ IKNPQQLDRT LKSMLAPDLL HGIGSAVDLL 
VTAREKQQKI VIVGDFDVDG ATSTALIVLA LRQLGFTDVD YLVPNRFEQG YGLSVEVAKL
ALEKNVELLI TVDNGVSSFD GVVFLKNAGV QVLITDHHLP PEILPPAHAI VNPNLTQCNF
PSKFLAGVGV AFYVMLALRA KLRDLGIFTT QTQPNFTELL DLVALGTISD VVPLDQNNRI
LAYQGLARIR AERCRPGIRA LAEIANRNMS RLTASDLGFS IGPRLNAAGR LDNMSVGVEL
LLADDMQHAR QLAFELDSLN QARKEIEQGM KQEALEICRN LTALSKQLPL GIVLYQVDWH
QGVLGILASR IKDKFHRPTI AFAQDQVGIL KGSARSIEGL HMRDVLERIH SRYPNMILKF
GGHAMAAGLS IREELFTQFQ QAFIQTVTEW LKEEQLQGII WTDGQLAPAF LNLETAELLR
QAGPWGQGFP EPCFDGEFTI LRQSIVGEKH LKMLVEPKQG GPLLDAIAFN IDKDYYPDFS
IKHARIAYKL DSNEFRGNRH VQLLVDYIEP LDH