Gene Haur_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3043 
Symbol 
ID5734915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3843996 
End bp3845123 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content48% 
IMG OID641280187 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001545809 
Protein GI159899562 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00264807 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATT TAACCTTGAA CCACGATCAA ATCCTTTTAG GTGATTGTCG TGATGTTCTT 
CCATTACTGC CACCAGCCAG CGTTGACCTC ATTTTCGCTG ACCCGCCCTA TAATTTGCAA
TTGCGTGGCG ATTTATTACG CCCCAACATG ACGCATGTTG ATGCAGTTGA TGATGATTGG
GACTCATTTC GTGATTTTGC TGCCTATGAT GCGTTTACAC GTGCTTGGCT GCAAGCATGT
CAGCGTGTTT TAAAAGATAA TGGCACAATG TGGGTGATTG GTAGTTATCA CAATATCTAT
CGGGTTGGTA CAATTTTACA AGACCTTGGC TTTTGGATTT TAAACGATAT TGTTTGGATT
AAGCGTAATC CAATGCCAAA TTTTCGTGGT GTACGCCTAA CCAATGCCCA TGAGACATTA
ATTTGGTGTG CGAAATTGCC AGGCCAGAAG TATACCTTTA ATTATCATGC CTTGCGCCAT
TTGAACGACG ATAAGCAAAT GCGCAGCGAT TGGGAATTTC CGCTGTGTAC TGGCAACGAA
CGCCTGCGGA TCAACGGCAA CAAAGTGCAT AGCACCCAAA AGCCCGAAGC GTTGCTCTAT
CGAGTATTAT TGGCAAGCTC GAATGTTGGT GATGTGGTGC TTGATCCATT TTTTGGCACG
GGCACGACGG GCGCGGTAGC CAAACGTTTG GCGCGTCACT ACATTGGCAT CGAGCGTGAT
CCCAGCTATG TTGAAGCAGC GCGAGGCCGG ATTGCCGCGA TTGAGTCGCC TAGCAGCACC
GATGCCCTGC AAGCCTTGCC AAGCAACAAA CGGCGGATTC CACGGATTCC GTTTGGCAAT
TTGTTGGAGC ATGGCTTGTT GCAAGCGGGC CAACAATTAT GGTTTAACCG CGATCCAAAC
TTGGTTGCCA CGCTGTTGGC TGATGCTTCG CTGCGCATGT CCGATGGCAC ACGCGGATCG
ATTCACAAGC TTGGTACAAT TTTGACAGGC CAACCAAGTT GCAATGGCTG GGAACATTGG
TTTTTTCAGG CGAGTGATGG TACTTTAACT TCGATTGATG TGTTGCGCCA AGAGGTGCGG
CGTTTACGCG AACAAACTCC AAGCGCCGAT GATTTAAGTG AGTTATGA
 
Protein sequence
MADLTLNHDQ ILLGDCRDVL PLLPPASVDL IFADPPYNLQ LRGDLLRPNM THVDAVDDDW 
DSFRDFAAYD AFTRAWLQAC QRVLKDNGTM WVIGSYHNIY RVGTILQDLG FWILNDIVWI
KRNPMPNFRG VRLTNAHETL IWCAKLPGQK YTFNYHALRH LNDDKQMRSD WEFPLCTGNE
RLRINGNKVH STQKPEALLY RVLLASSNVG DVVLDPFFGT GTTGAVAKRL ARHYIGIERD
PSYVEAARGR IAAIESPSST DALQALPSNK RRIPRIPFGN LLEHGLLQAG QQLWFNRDPN
LVATLLADAS LRMSDGTRGS IHKLGTILTG QPSCNGWEHW FFQASDGTLT SIDVLRQEVR
RLREQTPSAD DLSEL