Gene Ppha_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1980 
Symbol 
ID6463010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2064594 
End bp2066237 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content46% 
IMG OID642728179 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002018809 
Protein GI194337015 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.612228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAA ATCAAAAATT AGAGCTGACC TGGATAGGAA AGGGTGAAGA GCCGAAGCTT 
GAGCCGAGGA TTCTCATCGA GCAGCCGGAA TACAGTTGTG GTGACCCCGG TAGCGGCAAC
ATGCTGATTC AGGGTGACAA CCTGCTTGCA CTGAAGGCGC TGGAGCAGGA CTATGCGGGG
AAGGTCAAAT GCATCTATAT TGATCCACCG TATAACACAG GCAATGCGTT TGAGCATTAT
GACGACGGGA TTGAGCACAG CCAGTGGTTG AACCTGATGG CACCCCGGCT GAAAATCCTG
CGCGACCTGC TGGCTAATGA TGGTTCAATC TGGATCTCGA TTGATGATGA TGAAAGCCAT
TACCTCAAGG TGCTTTGTGA TGAGATTTTC GGAAGGCGCA ACTTTGTCAA TAATGTGATC
TGGGAGAAAA AATATTCACC TCAAAATGAT GCGAAATGGC TCTCTGACAG CCATGATCAT
ATTCTTGTCT ATGCCAAAAA CAAGGAGATC TGGAGGCCGT ATTTATTGCC GAGAACTGAA
GAAATGGATA AGCGATATAA AAATTATGAT AATGATTTAC GGGGTCTCTG GAAATCAAGT
GATTTATCCG TAAAGACATA TAGCTCGTCA ACTGACTACC CAATACAAAT ACCAAGTGGC
AGAATCGTTA ATCCACCTGC TGGTTATAGT TGGAGAGTTT CAAAAGAAAA ATTCGAGGAA
CTCGTTAAGG ATAATCGAAT ATGGTTTGGA AAAGATGGTA ATAATGTTCC TTCGATAAAG
CGTTTTTTAA GTGATGTTCA AGAAGGATTA GTCTCAAAAA CAATTTGGTA TCGTATAGAA
GTTGGTGATA ACCAGGATGC AAAAAGAGAA GGGAAGCAAT TCAATTCTGA GAATGTTTTT
GCTACGCCAA AACCAGAAAA GCTTGTTTAT CGGATAATGG CACTTGCCTC CCGAGAAGGA
GATCTTGTTC TCGACTCTTT CCTTGGCTCC GGCACAACTG CAGCCGTAGT GCATAAAATG
GGCCGAAAAT GGATCGGTAT TGAGCTTGGT GAACATGCAA AAACGCATTG TTATTCTCGC
CTGAAGCAGG TAGTTGATGG TACCGACCAG GGTGGTATCA GCAAAGCTGT TGAGTGGCAG
GGTGGCGGCG GCTTCAGGTT CTATACGCTT GCCCCTTCTC TCCTGAACAG GGACAAATAC
GGGAACTGGA TTATCAGCAA GAAGTACAAT CCCGATATGC TTGCAGCCGC AATGGCAAAG
CAGGAGGGGT TCCGATACCT GCCAGATGAA CATGTCTACT GGAAGCAGGC TCGAAGCAGC
GAAAAGGATT TTCTCTTCAC CACAACCGGC TTTATGACGG TTGAAATGCT TGACGGAATT
CATGAAGAGA TGCAGCCGGA TGAAAGCCTG CTGCTTGCCT GCAAGGCCTA TCAGAAAGAG
TGTGCCCATC GCTACCCGAA TATTTCCATC AAAAAAATTC CGAACATGCT GCTTGGCCGG
TGCGAATTTG GCCGGGAGGA TTACAGCCTG AATATTGTGC AGGTGCCTTA CGATCGTGCG
GAGGAGGAGG TGCCGGATGA ACCGGAGTTG GCTATTGAGA GTGAGGAAGT TGAAGAGTCA
AGACAGACCG ACCTTTTTGA GTGA
 
Protein sequence
MKPNQKLELT WIGKGEEPKL EPRILIEQPE YSCGDPGSGN MLIQGDNLLA LKALEQDYAG 
KVKCIYIDPP YNTGNAFEHY DDGIEHSQWL NLMAPRLKIL RDLLANDGSI WISIDDDESH
YLKVLCDEIF GRRNFVNNVI WEKKYSPQND AKWLSDSHDH ILVYAKNKEI WRPYLLPRTE
EMDKRYKNYD NDLRGLWKSS DLSVKTYSSS TDYPIQIPSG RIVNPPAGYS WRVSKEKFEE
LVKDNRIWFG KDGNNVPSIK RFLSDVQEGL VSKTIWYRIE VGDNQDAKRE GKQFNSENVF
ATPKPEKLVY RIMALASREG DLVLDSFLGS GTTAAVVHKM GRKWIGIELG EHAKTHCYSR
LKQVVDGTDQ GGISKAVEWQ GGGGFRFYTL APSLLNRDKY GNWIISKKYN PDMLAAAMAK
QEGFRYLPDE HVYWKQARSS EKDFLFTTTG FMTVEMLDGI HEEMQPDESL LLACKAYQKE
CAHRYPNISI KKIPNMLLGR CEFGREDYSL NIVQVPYDRA EEEVPDEPEL AIESEEVEES
RQTDLFE