Gene Hhal_1155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1155 
Symbol 
ID4710145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1258837 
End bp1260816 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content65% 
IMG OID639855629 
ProductN-6 DNA methylase 
Protein accessionYP_001002733 
Protein GI121997946 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACACCG AGAATCACTC CCAGATGGCC GGATTCATCT GGTCGGTCGC CGACCTGCTG 
CGCGGCGATC TCAAGCAATC CCAGTACGGA CGGGTCATCT TGCCGTTTAC CCTGCTGCGG
CGGCTGGAGT GCGTCCTGGA GCCGACCAAG GAGCAGGTGC TGGCCGCGGC GAAGGAGCAC
GCGGACAAGC CGCTGGGGGT GCGCGAGCGG CTTCTGCGCC GGGCGGCCGA TCAGCCTTTC
TTCAACACCT CGCCGCTGAC CCTGGGGACG CTGTCGGACA CGCAGACCGC GGACGACCTG
ATGAGCTACG TCCAGTCGTT CAGCCCCGAT GCCCGAGAGG TCTTTGAGCA CTTCAACTTC
GAGGACTTCG TCCAGCAGCT CTCGGCGAAC AATCTGCTCT ACCAGGTGGT GCAGCGCTTC
GCGGCCATGG ATCTCAGCCC CGGGCGGATC TCCAACTTCG GCATGGGCTC GATCTTCGAG
GAGCTGATCC GCAAGTTCGC CGAGAGCTCC AACGAGACCG CCGGTGAGCA CTTCACGCCC
CGCGACGTGG TCCACCTGAC CACCTCGCTG GTGCTCACCG ATCAGGACGA CAAGCTGCAA
CCGCACAGCG TGGTCACGGT CTATGACCCG GCCGCCGGCA CGGGTGGCTT CCTCTCCGAG
AGTGACGCCT ACATCCAGCA GGTCAGCGAT AACGTGACCG TTTCGCTGCA CGGCCAGGAG
CTCAACCCGG AGTCCTACGC CATCTGCAAG GCGGACATGC TGATCAAGGG CCAGCAGGTC
GAGAACATCA AGCTCGGCAA CACCCTCTCC GACGACGAGC TCGCCGGCGA GCGCTTCGAC
TTCATGCTTG CCAATCCGCC CTTCGGCGTG GAGTGGAAGA AGGTCCAGAA GCAGGTCACC
GACGAGCACA AGCGCTGGGG GTACAACGGC CGCTTCGGAC CGGGCCTGCC CCGGGTCTCC
GACGGCTCCC TGCTATTTCT GCTGCACCTG GTGAGCAAGG TCCGCGATCC GCGGGAGGGT
GGCTCGCGCA TCGGCATCAT CCTCAACGGC TCCCCGCTGT TCACCGGCGG GGCCGGTAGC
GGCGAGTCGG AGATCCGTCG CTTCCTGCTT GAGCGCGACC TGGTGGAGGC CATCGTCGCC
CTGCCCACGG ACATGTTCTA CAACACCGGC ATCGCCACCT ACGTCTGGAT CCTCTCCAAC
GACAAGCCGC CGGAGCGCCG CGGTCGGGTG CAGCTGATCA ACGCCACCGA GCGTTACAGC
AAGATGCGCA AGTCGCTCGG ATCCAAGCGG CAGTACATCG ACGATACAAA CATCGACAAC
ATCGTCCGCC TCTACGGCGC CTTCGAGGAG AGCGAAGAGA GTAAGCTCTT CCCGGTGGCG
GAGTTCGGCT ACCGGCGGAT CACCGTCGAG CGGCCCCTGC GGCTCAACTT CCAGGCCAGC
GAGGAGCGCA TCCGCCGGAT CCTCGACGAG AAGCCGATCC AGAAACTCGA CGAGGACACC
CAGGCCCGCC TCCTGGCCGC CTGCGAGGCC ATGGACGGCC AGATGCTCTA CCGGGACCGG
CAGGCGTTCA CCCGCGACCT GAAGCGTGCC CTGGAGGAGC GGGAAGTGAA GCTCGGCGCG
CCACCGATGA AAGCGGTCCT CAACGCCTTA TCCGAGCGCG ACCCGGAGGC CAAGCCGTGC
ACCGACGCCA AGGGCAACCC GGAGCCGGAC ACCAGCCTGC GTGACCACGA GAACGTGCCG
CTGACCGAAT CCGTCTACGA CTATTTCGAG CGCGAGGTGC GCCCGCACGT CCCCGACGCC
TGGATCGACG AGGCCAAGCG TGACGCCCAG GACGGCGAGG TGGGCATCGT CGGCTATGAG
ATCCCCTTCA ACCGCCACTT CTACAAGTTC ACCCCGCCGC GCCCGCTCGA AGAGATCGAC
GCGGACCTGA AGGTCTGCAC GGACCGGATC AAGCGGATGA TCGAGGAGCT GTCGGCATGA
 
Protein sequence
MNTENHSQMA GFIWSVADLL RGDLKQSQYG RVILPFTLLR RLECVLEPTK EQVLAAAKEH 
ADKPLGVRER LLRRAADQPF FNTSPLTLGT LSDTQTADDL MSYVQSFSPD AREVFEHFNF
EDFVQQLSAN NLLYQVVQRF AAMDLSPGRI SNFGMGSIFE ELIRKFAESS NETAGEHFTP
RDVVHLTTSL VLTDQDDKLQ PHSVVTVYDP AAGTGGFLSE SDAYIQQVSD NVTVSLHGQE
LNPESYAICK ADMLIKGQQV ENIKLGNTLS DDELAGERFD FMLANPPFGV EWKKVQKQVT
DEHKRWGYNG RFGPGLPRVS DGSLLFLLHL VSKVRDPREG GSRIGIILNG SPLFTGGAGS
GESEIRRFLL ERDLVEAIVA LPTDMFYNTG IATYVWILSN DKPPERRGRV QLINATERYS
KMRKSLGSKR QYIDDTNIDN IVRLYGAFEE SEESKLFPVA EFGYRRITVE RPLRLNFQAS
EERIRRILDE KPIQKLDEDT QARLLAACEA MDGQMLYRDR QAFTRDLKRA LEEREVKLGA
PPMKAVLNAL SERDPEAKPC TDAKGNPEPD TSLRDHENVP LTESVYDYFE REVRPHVPDA
WIDEAKRDAQ DGEVGIVGYE IPFNRHFYKF TPPRPLEEID ADLKVCTDRI KRMIEELSA