Gene Ppha_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_0789 
Symbol 
ID6461289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp826624 
End bp829011 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content49% 
IMG OID642727044 
Producttype III restriction protein res subunit 
Protein accessionYP_002017698 
Protein GI194335904 
COG category[S] Function unknown 
COG ID[COG4951] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.891259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAGG AGAGCCGGTA TAACCATGTT TCCGGGCAAT TGCCTGATGC ATTAAGGCAA 
CTACAGGAAG AAAATGTCAG GTTGAAAGCT CTGCTTGATG CAAATGGCAT ACCGTGGGAA
GAGTCAGTTC CATCAGAAGA AAATTCATCT GAAATCCCAT TGACATCGAC AGTGCAGCGG
TCAACTATGG AAAAAGTTGC ACTGTTCGGC CAACTATTCC GTGGCCGCAG GGATGTGTAT
CCAATTCGTT GGGAATCTGC CAAAGGAATG TCGGGTTATT CTCCTGCGTG CAGCAACGAG
TGGCGGAAAG GTATCTGCAA TAAACCCCGG ATCAAATGTG GTGATTGCAA GCAGCGCTCG
CTGTTACCGG TTACAGAAAA TGTTCTTTAC CGCCATCTTA CCGGCAAGCA TACCATCGGC
GTATATCCAA TGCTGCCGGA TGAAACCTGT TATTTTATTG CGACTGACTT TGATGATGAG
GGATGGCATG AGGATGCATC AGCCTTCATG CAATCGTGCA GGGAGTTTAA TATTCCTGCT
ACTTTGGAGA TTTCGCGTTC AGGTAACGGT GCGCATGTAT GGATTTTTTT TGCTGAACCT
GTTCCGGCGT CAAATGCACG GCAGCTTGGT GCTGCACTTA TAAGCCAGAC ATGTGACCGT
ACCCGTCAAC TTGCCATAAA AAGCTACGAT CGGTTTTTTC CTAATCAGGA CACCCTCCCA
AAAGGTGGTT TCGGCAACCT GATTGCGCTC CCGTTGCAGC AAAAGCCACG ATCGAAGGGG
TTCAGTGTAT TTGTTGATGA AAACTTTGTC ATCTATCCTG ACCAGTGGGA ATATCTCTCA
TCCATTCAAA GACTATCACG TCTCGAACTG GATTCAGCCA TACAGCGATC AGTTGGCGTT
CGCCATCCGC TTGATATCGC TTTTATTACA GAAGAGGAGG AACAAAAGCC ATGGCAACGC
TCATTGCAGA GTTTCAGTCG GATCCCCGGC CCCATGCCTG AATCACTGAC TCTTGTGTTC
GCTAATCAGA TCTTCATTGC CAAAGCCGAT CTGCCCCAGC CATTGGCCAA CCGTCTCATT
CGCCTGGCTG CTTTTCAAAA TCCCGAGTTT TACAAGGCTC AGGCAATGCG TCTGCCCGTA
TGGGACAAAT CCCGAATCGT CTGTTGTGCC GAAAACTTTC CGCTTCATAT CAGTTTGCCG
AGGGGATGCT TTGACGCAGT CACCGAACTG CTGCGACAGC ACAATATCCG TATGGACATT
CAGGATGAGC GCATATCAGG CACCGAAATA TCAGTAACGT TTACCGGTCA GTTAAGAAAA
GACCAGCAAA CGGCAATCAC TGCGATGCTC AAGCACCAAA CAGGGATCCT CTCCGCACCC
ACAGCATTCG GCAAGACCGT CATGGCTGCC GCAATAATTG CCCGGCGCAA AGTGAGCACC
CTCATTCTTG TTCATCGAGC TGAACTTCTT CAGCAATGGA GGGAGCGGTT ATCCACCTTT
CTCGACCTGT CCGGTGCCTC TCCTGGTTTT ATTGGAAGCG GCAAAAAGAA ACTATCAGGC
TTGATTGATA TCGCTGTCAT GCAATCTCTT TCACGGCGTG ATGATCTTGT CGTGTTGCTC
GACAGCTATG GCCATATCAT TGTTGACGAG TGTCATCATC TTTCAGCCTT TACTTTTGAG
GCAATTCTCA AGCAGTCTAA AGCGGCATAT GTTCTGGGAC TTACAGCAAC ACCTATTCGT
CGAGACGGTC ATCAGCCCAT AATCTTTATG CAATGCGGCC CGGTAAGGCA TCGCGCGCTT
CAAGCAGAAA ATGCTCCAGT TCGCTTAGAG GTTCGGCCGC TAAACCTATT CTCGCCAGTA
ATTCCCCAAG GATCTGGTAT TCAGGATGTT TTCCGTATCC TCACCCATGA CTCCTCCCGC
AACCAATGCA TTGCAAAGGA TATTCTGGCT GCATATCACG AAGGTCGAAA GATCCTTGTG
TTGACGGAAC GCACAGAACA ACTGGAACTG ATACGCGAAG TACTTGCTGA TCAAATTCCG
CATAGCTTTT TGCTGCATGG CCGGTTGACA AAAAAGCAGC GCGCAACTAC GCTTGCAGGT
CTTGCAGAAC TTGATGATTC GGTTCCACGG GTAATTCTGG CAACCGGGCG CCTGATTGGT
GAAGGTTTTG ATCATCCCCC TCTTGATACC ATGGTGCTTG CCATGCCTGT TTCATGGCAT
GGAACATTAC AGCAGTACGC AGGTCGTCTG CACCGTGAAC ATGTCAATAA GGGTGATGTT
CGTATCTACG ATTATGTCGA ACATGACAAT CCGCAGCTTG CACGAATGTG GGAAAAACGC
CAGCGTGGTT ATCGAGCAAT GGGGTATCGT ATCAGTATGC GGGAGTGA
 
Protein sequence
MDEESRYNHV SGQLPDALRQ LQEENVRLKA LLDANGIPWE ESVPSEENSS EIPLTSTVQR 
STMEKVALFG QLFRGRRDVY PIRWESAKGM SGYSPACSNE WRKGICNKPR IKCGDCKQRS
LLPVTENVLY RHLTGKHTIG VYPMLPDETC YFIATDFDDE GWHEDASAFM QSCREFNIPA
TLEISRSGNG AHVWIFFAEP VPASNARQLG AALISQTCDR TRQLAIKSYD RFFPNQDTLP
KGGFGNLIAL PLQQKPRSKG FSVFVDENFV IYPDQWEYLS SIQRLSRLEL DSAIQRSVGV
RHPLDIAFIT EEEEQKPWQR SLQSFSRIPG PMPESLTLVF ANQIFIAKAD LPQPLANRLI
RLAAFQNPEF YKAQAMRLPV WDKSRIVCCA ENFPLHISLP RGCFDAVTEL LRQHNIRMDI
QDERISGTEI SVTFTGQLRK DQQTAITAML KHQTGILSAP TAFGKTVMAA AIIARRKVST
LILVHRAELL QQWRERLSTF LDLSGASPGF IGSGKKKLSG LIDIAVMQSL SRRDDLVVLL
DSYGHIIVDE CHHLSAFTFE AILKQSKAAY VLGLTATPIR RDGHQPIIFM QCGPVRHRAL
QAENAPVRLE VRPLNLFSPV IPQGSGIQDV FRILTHDSSR NQCIAKDILA AYHEGRKILV
LTERTEQLEL IREVLADQIP HSFLLHGRLT KKQRATTLAG LAELDDSVPR VILATGRLIG
EGFDHPPLDT MVLAMPVSWH GTLQQYAGRL HREHVNKGDV RIYDYVEHDN PQLARMWEKR
QRGYRAMGYR ISMRE