Gene Hhal_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1303 
Symbol 
ID4710795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1413395 
End bp1414927 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content59% 
IMG OID639855772 
Productintegrase catalytic subunit 
Protein accessionYP_001002874 
Protein GI121998087 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACGT ACGAGCAGCG GATCAAGGCA GTAGAGCTCT ACATCCAGTA CGACAAGAAC 
GCGGCAGTTA CCGTCCGGGA GCTCGGGTAT CCGTCGAAGA AGAACCTGCG GCGCTGGTAT
GACATTTACG CAGCAACCGG TGATTTGCCG AAGCGATCGA AGCGCAAACC AAGGTATTCG
GCGGAACAGA AGCAAAGGGC CGTTGACCAC TACATGACCC ATGGCCGTTG TCTAGCGAGG
ACTCGCAAGG CGCTAGGTTA TCCTGGTGTC GAAACGCTAA GCCATTGGGT TCTGGAGCGT
GAGCCCGACT TGCGTACCGG ATCGAGCGCC AGCTTAACGA AGCCTCCTTC GTCGGATGAG
ACTAAGCGGG AGGCCGTCAT TGAGCTATGT TCCCGGCAAG GGGCGGCTTC AGAGGTTGCC
GAGAGGGTGG GCGTCAGCAG GCAGGTGCTG TACAAGTGGA AGGACCGCTT ACTCGGTGCC
GAGGCGCGCC CCCCAATGAA ACCGCGTGAC GATACGACCT CGCAAAGCGA ACGTGGCGCC
CTGGAGCAAG AGATCGAAAC GCTGCAACGG CGCGTCCACC GCCTTCAGCT TGAACACGAT
CTGCTAACAA AAGCGAACGA ACTGTTAAAA AAAGACCACG GCGTCAACCT GCAGCTCCTG
ACGAACAAGG AGAAGACCCT GCTGGTTGAC GCCCTCCGAA ACATCTACTC GCTCACGGAG
CTGTTCACGC CGCTGCGCCT AGCCCGTAGC AGCTACTTCT ACCATCGGGC TCGGCTGCGA
CGGCCGGAGA AGTACAGCGC CCTTCGTGGC CTTGTCAGTA GTCTGTTTGA GGACAACCAC
TACTGTTACG GCTACCGACG CATCCGCGTC GAACTCCACC GTCTTGGCAT CGTGATCTCC
GAGAAGGTGA TCCGGCGGCT CATGGCGGAG GAGCAGCTCG TTGTCCAGAC GACCAAGTGC
CGTCGTTTTA GCTCGTACCG CGGCGAAATC ACTCCGGCCC CTGAGAACGT GGTTAACCGG
GACTTCAGTG CACCGGCGCC TAACCGCAAG TGGCTGACCG ATCTCACCGA GTTTCAGATT
CCGGCCGGCA AGGTCTACCT ATCCCCGTTA ATCGACTGCT TCGACGGGCT GGCGGTCAGT
TGGACGGTGG GAACCCGGCC TGATGCTGAG CTGGTGAACA CGATGCTCGA TTACGCCGTT
GCGGTGCTTA AAGACGATGA GAAGCCGGTG GTTCATAGCG ATCGCGGGGC GCACTATCGT
TGGCCTGGGT GGTTATCCCG CATTGAGTTA GCCGGCCTGA CCCGGTCGAT GTCACGCAAA
GGCTGTACGC CAGACAACGC CGCCTGTGAA GGCTTCTTTG GACGCCTGAA GGCCGAGTTC
TTCTATACCC GTGACTGGCA CGGCGTGACC CTTGAGCAGT TCATCGAGAA ACTCGATGCC
TATCTGCAAT GGTACAACCG AAAACGCGTT AAGCTGTCGC TAGGAGGCCG GAGCCCTCTT
GAGTACCGAG ACAGCCTCGG AATTGCAGCA TGA
 
Protein sequence
MYTYEQRIKA VELYIQYDKN AAVTVRELGY PSKKNLRRWY DIYAATGDLP KRSKRKPRYS 
AEQKQRAVDH YMTHGRCLAR TRKALGYPGV ETLSHWVLER EPDLRTGSSA SLTKPPSSDE
TKREAVIELC SRQGAASEVA ERVGVSRQVL YKWKDRLLGA EARPPMKPRD DTTSQSERGA
LEQEIETLQR RVHRLQLEHD LLTKANELLK KDHGVNLQLL TNKEKTLLVD ALRNIYSLTE
LFTPLRLARS SYFYHRARLR RPEKYSALRG LVSSLFEDNH YCYGYRRIRV ELHRLGIVIS
EKVIRRLMAE EQLVVQTTKC RRFSSYRGEI TPAPENVVNR DFSAPAPNRK WLTDLTEFQI
PAGKVYLSPL IDCFDGLAVS WTVGTRPDAE LVNTMLDYAV AVLKDDEKPV VHSDRGAHYR
WPGWLSRIEL AGLTRSMSRK GCTPDNAACE GFFGRLKAEF FYTRDWHGVT LEQFIEKLDA
YLQWYNRKRV KLSLGGRSPL EYRDSLGIAA