Gene Haur_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0114 
Symbol 
ID5732007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp146839 
End bp147900 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content51% 
IMG OID641277236 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001542894 
Protein GI159896647 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGGT TGCTTTGGTT TCGCAAACCA CGTTTGCTCA ATCTTCTGGC TGGTTGGCGT 
GATCTCACAG CCCTCGTCGC ACTCCGCGCT AGTATTCTTG CTACGACCTC GCACTTAGCC
GATTATGCCG ATGGGATCGC TGATCAGGCT GAGAGTGTCG CGTTTGATTA TGCTTTTTTG
CTTGGTGAGT TGCAGCAAAT TAGCGAGGCC CAGACCCTCG AACGAGCGCA TTACTATATC
GAGCGGCTGG CTCGGAGCAT TGCCACCGTG CGCACCACCG CGATCAACGA TATTAACCTC
AATCGTTGGA AGGAATACGA CGATATTAAC ACTGATAGCC TGTGGATGAT CGATCGCCGC
GATGGCTCGG GAGTGCATTC GGCGGGCTAT TGGGGCAATT TTGTACCGCA AATTCCCAAT
CAGCTGATGC GGCGTTATAC CAAACAGGGC GATTGGGTAA TTGATACCTT TGCAGGCTCA
GGCACAACGT TAATCGAAGC CCAACGCTTG GGTCGCAATG TGCTGGGCGT TGAGTTACAG
CCGCATATGG TCGAGTATGC CAACCAAGCC GTCGAGCGCG AGCCAAATCC GCTGGCGATT
GTGGCGCGTT CAGTCCATGG CGATTGCACT ACAATCAATT GGCAAGCACT TTTAGCAGAT
TATGGTCAGT GCCATGTACA GTTGGCGATT ATGCACCCGC CCTATTTCGA TATTATCAAC
TTCAGCGACG ATGAACGCGA TTTATCCAAT GCACCTTCAG TCGAGGATTT TCTGGGCCAA
ATGGCGGCGG CGGTGGCTCA GGTTAAGCCT GTTTTGCAAC GCGGTCGGCA TCTGGCGGTA
ATTATCGGCG ATAAATATAT GCATGGCGAG TGGGTGGCTC TTGGATTTCG CACCATGGAA
GTTGTACAGC AGCAAGGCTT TCAACTCAAA AGCATCATCG TCAAAAATTT TGAAGATACC
ACTGGCAAGC GCCATCAAAA AGAGCTATGG CGCTATCGCG CCTTGGTCGG CGGCTTTTAT
ATCTTCAAGC ACGAATATAT TTTTCTATTT CGCAAGAAGT AA
 
Protein sequence
MDRLLWFRKP RLLNLLAGWR DLTALVALRA SILATTSHLA DYADGIADQA ESVAFDYAFL 
LGELQQISEA QTLERAHYYI ERLARSIATV RTTAINDINL NRWKEYDDIN TDSLWMIDRR
DGSGVHSAGY WGNFVPQIPN QLMRRYTKQG DWVIDTFAGS GTTLIEAQRL GRNVLGVELQ
PHMVEYANQA VEREPNPLAI VARSVHGDCT TINWQALLAD YGQCHVQLAI MHPPYFDIIN
FSDDERDLSN APSVEDFLGQ MAAAVAQVKP VLQRGRHLAV IIGDKYMHGE WVALGFRTME
VVQQQGFQLK SIIVKNFEDT TGKRHQKELW RYRALVGGFY IFKHEYIFLF RKK