Gene Hhal_1480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1480 
Symbol 
ID4709151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1601481 
End bp1603151 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content69% 
IMG OID639855947 
ProductDNA repair protein RecN 
Protein accessionYP_001003049 
Protein GI121998262 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.269893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCAGGG AAATCCACAT CCGCGATTTC GCCATCGTCG AGCGACTCGA CCTGGAGTTC 
GGCGAGGGTA TGCATGTGCT CACCGGCGAG ACGGGCGCCG GCAAGTCGAT CCTCCTCGAC
GCCCTCGGCC TGTGCCTGGG CGACCGGGCC AAGGCGGAGA TCGTCCGCCC CGGCACCGAC
CGCGCCGAGG TCAGCGCCGT GTTCGATCCC CCGTCCACGC GGATTGCCCG CTGGCTGGCC
GAGCGCGAGC TTGAGGGCGA GGATGAAGTC ATTGTCCGGC GCGTCATCCA GAGCAACGGA
CGCTCCCGCG GGTTCATCAA CGGCACGCCG GTTGCGCTGC AGATGCTCCG TGAGCTCGGT
GAGCAGCTGG TCGACATCCA CGGCCAGCAC GCCCACCAGT CGTTGCTCCG CCCCGCTACT
CAGCGAGAAC TGCTCGACGC CTACGCCGAG GCCACCGAGG CCCGCCGCGA GGTCGCCGAG
CGCTTCCGCG AGCTGCGCGA TCTCGACCGT GAACTCACCG ACCTTGAGGG CCAGGACAAC
GACTACGAAG ACCGCCTCGC CCTGCTGCGC CATCAGGTCG ATGAACTGGA GGCGGCCGCC
CCTTCCCCGG ACGGCCTGAC AGAGCTGGAG AACGAGCACC AACGGCTGGC CCACGCCGAA
TCCCTGATCA CGCTGGCCCA GACCCAGCTC CAGGCCCTAT CCGATGACGA CCACGCAGCC
CAGGCCCTGC TCGGCCGGGC CGTCCGCGAA CTTGAAGAAC GGCGTGACCT CGCCCCGGCT
CTGGGCGAGG CGGCGGATCT GTTCCAAAAC GCCCTGGCCC ATCTCGAAGA GGGCTGTCAG
ACCCTGCGCG CCTTCGCCGA CGGGCTTGAG ATCGACCCGC AGCGGCTGGC TGCGCTCGAC
CAACGGATCA GTGACCTACG GGATCTTGCG CGCAAGCATC GGGTTGAGGT CGAGCAGCTG
CCCGAGACCC TCGAAGCGTT ACAGGCCCAA CTGGAACGCC TGGAAAACGC CGGTCAGCGT
CTGGAGATCC TGCGCCAGGA ACGTAGGGCG GCCGTCGAGC GCTACCAGGA GGCGGCCCGG
CAACTCTCCC GGCAGCGTCA GGAGGCGGGG GATCGCCTGG CCACCGAGGT CAATCAGCTG
CTCTCGGAGC TCGGCATGGC GGGCGCCGCA CTGATCCCGG TGATCGAGTT CGAGGCGGAG
GCCACCCCGA GCGGGCACGG CCTGGATCGG ATCGAACTCC AGGTCCGAAC GAATGCCGGC
CAAGCCGCCG GCCCCCTCGC CAAGGTCGCC TCAGGAGGCG AGCTCTCTCG CCTCGGTTTG
GCCATTCAGG TCGCCACGGT CAACCGCGCC AGCGGGGTCC CGACGCTGGT CTTCGATGAG
GCCGACACCG GCATCGGCGG CGCCGTCGCC GAGGTCGTCG GGCGGATGCT GCGCACACTG
GGCCAGCGCT ACCAAGTCCT GTGCGTAACT CACCTGCCTC AGGTGGCTGC CCAGGGCGGG
CACCACTTCC TGGTCCGCAA GTCCGAGGCA GAGGGCAGTA CCCGCACTGA GGTCGACCCC
CTCTCAACCA CCCAGCGGAT CGAGGAGATC GCCCGCATGC TCGGTGGCCT GGAGATCACC
GATCACGAGC GCGCCGCGGC CGAGCGGATG CTCGAGCGCG GCGGCAGCTA G
 
Protein sequence
MLREIHIRDF AIVERLDLEF GEGMHVLTGE TGAGKSILLD ALGLCLGDRA KAEIVRPGTD 
RAEVSAVFDP PSTRIARWLA ERELEGEDEV IVRRVIQSNG RSRGFINGTP VALQMLRELG
EQLVDIHGQH AHQSLLRPAT QRELLDAYAE ATEARREVAE RFRELRDLDR ELTDLEGQDN
DYEDRLALLR HQVDELEAAA PSPDGLTELE NEHQRLAHAE SLITLAQTQL QALSDDDHAA
QALLGRAVRE LEERRDLAPA LGEAADLFQN ALAHLEEGCQ TLRAFADGLE IDPQRLAALD
QRISDLRDLA RKHRVEVEQL PETLEALQAQ LERLENAGQR LEILRQERRA AVERYQEAAR
QLSRQRQEAG DRLATEVNQL LSELGMAGAA LIPVIEFEAE ATPSGHGLDR IELQVRTNAG
QAAGPLAKVA SGGELSRLGL AIQVATVNRA SGVPTLVFDE ADTGIGGAVA EVVGRMLRTL
GQRYQVLCVT HLPQVAAQGG HHFLVRKSEA EGSTRTEVDP LSTTQRIEEI ARMLGGLEIT
DHERAAAERM LERGGS