Gene Hhal_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0419 
Symbol 
ID4711255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp487062 
End bp488156 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content65% 
IMG OID639854877 
Producttransposase, IS4 family protein 
Protein accessionYP_001002010 
Protein GI121997223 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGGTG AGGATGTCGA GCAACAGCAC ATGTTCAGCT ACGTCAGCCT CGAAGATCGC 
ATCCCCAAGA ATCACCCTCT TCGCGAGATC CGTGTCCTCG TTGATCGGCT GCTCGACTCG
ATCTCGGACG AGCTGGACGC GGTGTACTCG GGCACCGGTC GGCCGTCGAT CCCGCCGGAG
CGACTGATCC GGGCGCTTTT GCTTCAGGCC CTGTACTCGA TCCGCTCGGG GCGGCTGCTC
ATGGAGCAGC TCGACTGCAA CCTCCTGTTC CGCTGGTTCG TCGGGCTCCA GGTCGATGAC
CCCGTCTGGC ACCCGACCAC CTTCACCAAG AGCCGGGACC GCCTGATCGA GGCGGAGGTT
GCGCAGCGCC TGCTGCGCTC GTTCACGGAA CTCGAGGAGG TTCGGCCCCT GCTCTCCGAC
GAGCACTTCT CGGTCGATGG CACGCTCATC GAGGCCTGGG CCTCGATGAA GAGCTTCAAG
CCGATCAGCC AGGACGGCGA CAGCTCGGGT GATGATGACT TGCAGAGCGG CGGGCGCAAT
CCGACGGTCA ACTTCCGGGG TCAGCAGCGG CGCAACGATA CCCACGCCTC GAGCACGGAC
CCGAATGCTC GGCTCTACCG CAAGGGCCAG GGGCAGCCTG CGCGGCTTTG CTATATCGGC
CATGCCCTGA TGGAGAACCG TCATGGGCTC ATCGTCGATA CCCGCCTAAC CCAGGCCTGT
GGCACCGCCG AGCGGGAGGC GGCCCTGGAG ATGATCCGCG AGATCCCTGC AGAGCGAGGC
CGGCTGACGC TTGCTGGCGA CGCCGGCTAC AACACCCGAG ATTTCGTCCA GGCCCTGCGC
GAGTACGAGG TTACGCCCCA TGTTGCCGAG AAGCGCCGGT TCAATGCTGT GGATGGTCGG
ACGACGCGCC ACCCCGGCTA TGCGGTCAGC CAGCGCATCC GCAAGCGCGT CGAGGAGTTC
TTCGGCTGGT CCAAGACCGT CGGTGGCCTT CGCAAGACCC GCTTCATCGG ACCAGACAGG
GTGGGGTGGG ACTTCGGGTT CCATGCCCTG GCCTACAACC TGGTCCGCAC ACCCAAACTG
CTCGGGGCTG GCTGA
 
Protein sequence
MRGEDVEQQH MFSYVSLEDR IPKNHPLREI RVLVDRLLDS ISDELDAVYS GTGRPSIPPE 
RLIRALLLQA LYSIRSGRLL MEQLDCNLLF RWFVGLQVDD PVWHPTTFTK SRDRLIEAEV
AQRLLRSFTE LEEVRPLLSD EHFSVDGTLI EAWASMKSFK PISQDGDSSG DDDLQSGGRN
PTVNFRGQQR RNDTHASSTD PNARLYRKGQ GQPARLCYIG HALMENRHGL IVDTRLTQAC
GTAEREAALE MIREIPAERG RLTLAGDAGY NTRDFVQALR EYEVTPHVAE KRRFNAVDGR
TTRHPGYAVS QRIRKRVEEF FGWSKTVGGL RKTRFIGPDR VGWDFGFHAL AYNLVRTPKL
LGAG