Gene Hhal_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0784 
Symbol 
ID4709364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp870642 
End bp872330 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content67% 
IMG OID639855244 
Producttransposase, IS4 family protein 
Protein accessionYP_001002363 
Protein GI121997576 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGTAC GCGTCGTCCG ATCCGGTCCC CGCCGCTACG TCCGCCTCGT CGAGGGCTAC 
CGCGACGACA ACGGACGGGT GAAGCAACGC ACCGTCGCGA ACCTCGGGCG GGCCGACCAA
CTCAGCGATG ACGAGGTGGA CGGCCTCATC GAGGGCCTGC AACGCGCTGT TGGGCGGTCT
GAGCCCCGGA GAGCGGAACC GGAGTTTCAG CGTGCTCGCG CCTTCGGAGA CCTTTGGACG
CTCCATCAGC TCTGGCAGGA ACTCGGCCTG GATGACGCCC TTCGCAAGGC GCTGCGCTCG
TCGCGGCGCG AGTTCGACGC GGAGGCGCTG ATCCGCGCCA TGGTCTTCAA CCGGCTGGCT
GCACCGCGGA GCAAGCGCGG CGTGCTCGAC TGGCTTCAGG AGGATGTCAG CCTCCCGGGT
GTGGATTCTG AGCAACTGCA CCACGAGCAG CTGCTGCGCG CCATGGACGC GCTGATGAAC
CACCCCGAGC GGGTCGAGCA GGCCATTGCC GGGCAGCTCA AGCCCCTGCT CGATCAGGAG
CTCAGCGTCG TGTTCTGGGA CATCACCACG GTGCGCATCC ACGGTGTGGA GGAGGTCGCG
GACGATCTGC GCCAGCGCGG CAAGAGCAAG GACACAGGCG GCGTTGCCCG CCAATTCGCC
CTCGGGGTCG TGCAGACCTC CGATGGCCTG CCGATCGCCC ACGAGGTCTT CGAGGGCAAT
GTCGCGGAGA CGCGCACCCT GGCACCCATG CTCCAGCGCC TGCTCAGCCG CTACCCGATC
AGGCGTGTGG CCGTGGTCGC GGATCGGGGG CTGCTCAGCC TCGACAATGT CGACACGCTG
GAGGCGCTCA GCGCGCAGCA CGACCTGGCT ATCGATTACA TCCTGGCGGT GCCCGGACGC
CGGTATGCGG AGTTCACCGG CTTGATGAGC GAGCTGCAGC CGCAGCTCGA GAAGCAGGCG
GCAGAGCATG AGGGCGAAGC CGTCACCGAG ACCACTTGGC AGGGGCGCCG GCTGATTGTG
GCGCACAACC CGCAACGGGC TGCCGAACAG CAGGCCTCGC GCCGGCACAA GATCCACCGG
CTAGAGCAGA TCGGCGAGGC CCTGGCCCAG CGACTCGACA ACCAGGATGC GGGAAAGCCC
GGCCGCGGGC GGCGCTCGAC TGATCGCAGC GCCCATCGGC GCTTCCATCA GCACGTCCTG
CGCAGCAGCC TCTCGAGCAT CGTCAAGGCC GACCTCGGCG CCGAGCAATT CCGCTATGAC
ATCGACGAAG AGGCGTGGGC GGCCGCCGAG CGGCTCGACG GCAAGCTGCT GCTGGTCACG
AGCCTCCACG AGTACTCGCC GAGCGAGATC GCCGACCGCT ATCGGGCCCT TGCGGACATT
GAGCGCGGCT TCCGCGTGCT CAAGAGCGAC ATCCAAATCG CGCCGGTCTA CCACCGGCTG
CCGGAGCGCG TCCGTGCCCA CGCGCTCATC TGCTTCCTGG CCCTGGTTCT GCATCGCATC
CTGCGCCGGC GTCTCAAGGC GGCTGGTAGC GGTTACTCAC CGGAGAACGC GTTACGGGCG
CTGCGCCGTA TACAGCGCCA CAAAGTTCAC CTCGCCGGGC AGGACTACGA GCGGGTGACC
AAGCCGACAC CGGAGCAACT CCAGCTCTTC GACGAGCTAG GTGTAGAGGT GCCTCAGCAC
CCGGCTTGA
 
Protein sequence
MYVRVVRSGP RRYVRLVEGY RDDNGRVKQR TVANLGRADQ LSDDEVDGLI EGLQRAVGRS 
EPRRAEPEFQ RARAFGDLWT LHQLWQELGL DDALRKALRS SRREFDAEAL IRAMVFNRLA
APRSKRGVLD WLQEDVSLPG VDSEQLHHEQ LLRAMDALMN HPERVEQAIA GQLKPLLDQE
LSVVFWDITT VRIHGVEEVA DDLRQRGKSK DTGGVARQFA LGVVQTSDGL PIAHEVFEGN
VAETRTLAPM LQRLLSRYPI RRVAVVADRG LLSLDNVDTL EALSAQHDLA IDYILAVPGR
RYAEFTGLMS ELQPQLEKQA AEHEGEAVTE TTWQGRRLIV AHNPQRAAEQ QASRRHKIHR
LEQIGEALAQ RLDNQDAGKP GRGRRSTDRS AHRRFHQHVL RSSLSSIVKA DLGAEQFRYD
IDEEAWAAAE RLDGKLLLVT SLHEYSPSEI ADRYRALADI ERGFRVLKSD IQIAPVYHRL
PERVRAHALI CFLALVLHRI LRRRLKAAGS GYSPENALRA LRRIQRHKVH LAGQDYERVT
KPTPEQLQLF DELGVEVPQH PA