Gene Hhal_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0414 
Symbol 
ID4711541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp481678 
End bp482988 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID639854873 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001002006 
Protein GI121997219 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.596456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTCTT CCAATGCTGC GACCGCACTG CTTTACTCGG CCTTGGAGCT GGCCACGAGG 
GCGGATGTGG CCCGTGAACT CGGGGTGGAC GAACGTACGG TCCGGCGGTG GGCGAAAGGC
GAGATCCCGA TGCCCAATCG TTGGGAGCCG GCGCTGAGCC AGCTCCTGCT CCGGCGGGCT
CCCCTGCGGT CTGACGTTGA CGGGCAGTTC TCCTTCATTG ACCTGTTCGC CGGAGTGGGG
GGTATCCGAC AGGGGTTTGA AAGCGTCGGT GGCCACTGCG TCTTTTCCTC CGAATGGGAC
CGTTTCGCCC TCCAGACCTA CCGTGCAAAC TTTGGGAACG AGGGCGAAGA GATCCAGACG
GACATCCGTC AGATTACGGC GGTCTCAGAT GACGCGGATG AGAACAGCCG CTCTATCGAC
GAACGCATTC CTCAGCACGA CGTCCTGTTG GCCGGATTCC CGTGTCAGCC CTTTTCGCTG
GCCGGCGTTT CCAAGAAGAA CAGCCTGGGA CGCGAGCATG GGTTCCTGTG CGAGGCGCAG
GGAACCCTGT TCTTTGACGT TGCGCGGATC ATCGAGGTAA AGCGTCCACG GGCATTCCTC
CTCGAGAACG TGAAAAACCT GCGAAGTCAT GATGGCGGAC GTACCTACGA AGTGATCCGC
CGCGTGCTGG AAGAGCTCGG TTATCGCGTG CATGATCGGG TCATCGACGG CAAGGGGTTC
GTGCCCCAGC ACCGCGAGCG GATCTACATG GTTGGCTTCC GTAAGGATAC GCCCTTCACC
TGGAACCAAC TGGACTTCCC CGCACCGGAC GCCCGCACCC TCCGGGAGGT CCTGCACCCG
GAGGACGGAT CGGAAGCCGC GGAACCCCCT TATACCGAGG GTGACTTGGC CACCGTAGGC
GACAAGTACG TCCTGAGCGA GAAACTCTGG AAATACCTGC AGGACTATCG GGCCAAGCAT
GAGCACGCGG GCAACGGCTT CGGCTACAGT AAGGTCGGCC CGGAAGACAC CGCACGGACA
CTGTCCGCCC GGTACCACAA GGACGGCTCC GAGATCCTGG TTGACCGGGG AGCTGGCGAG
CGGCCGCGTC GGCTTACGCC ACGCGAGTGC GCCCGACTGA TGGGCTTCGA TGACAGTTTC
CGGATCCCGG TGAGTGACAC GCAGGCTTAT CGCCAGTTTG GTAACTCCGT TGTCGTTCCG
GTCATCCGTG AGATCGCGTC TGCGATGGCG CCACACGTCC TCGCGGACAT CCGCTCCGAC
CAGGATGGCC ATCAGCTGGC GCTGCCGATG GAGTTCAGGG AGACGGCGTG A
 
Protein sequence
MHSSNAATAL LYSALELATR ADVARELGVD ERTVRRWAKG EIPMPNRWEP ALSQLLLRRA 
PLRSDVDGQF SFIDLFAGVG GIRQGFESVG GHCVFSSEWD RFALQTYRAN FGNEGEEIQT
DIRQITAVSD DADENSRSID ERIPQHDVLL AGFPCQPFSL AGVSKKNSLG REHGFLCEAQ
GTLFFDVARI IEVKRPRAFL LENVKNLRSH DGGRTYEVIR RVLEELGYRV HDRVIDGKGF
VPQHRERIYM VGFRKDTPFT WNQLDFPAPD ARTLREVLHP EDGSEAAEPP YTEGDLATVG
DKYVLSEKLW KYLQDYRAKH EHAGNGFGYS KVGPEDTART LSARYHKDGS EILVDRGAGE
RPRRLTPREC ARLMGFDDSF RIPVSDTQAY RQFGNSVVVP VIREIASAMA PHVLADIRSD
QDGHQLALPM EFRETA