Gene Ppha_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2065 
Symbol 
ID6462820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2155030 
End bp2156427 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content47% 
IMG OID642728259 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002018889 
Protein GI194337095 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.145613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTGC CTGATTTTTT TGACGATAAT CGCCATAACC CTGCGGGAGC ATCAGGAAGA 
GAACAGCCCG ACCTTGATGG TCTTGATTCC ATGTTTGATT CGGAGGATCT TCTTGACCGA
ATCAGTCAAT ATAATGAAGA GGGGTTTCAG CTTGAAGCAC TTGCTGTAGC CCGTCGTCTT
GAAGAGATAG CGCCTTTTAA TACGGAAACC TGGTTCCATC TGGGTAATTG CCTGACGCTG
AACGGTTTTT TTGATGATGC ACTTGAAGCG TTTCAGCGGG CGGCCGTCTT CAGTCCCATG
GACAGCGAAA TGCGGCTGAA TCTCGCTCTT GCCTGGTTCA ACACCGGTAA TCATGATGAA
GCGCTCAGAG AACTCGAAGA TATTCTTGTT GACTCCTCCA TTGAAAAGGA GTTCCATTAC
TATCGCGGAA TTATCCTGCA GCGTCTTGAA CGGTTTGTGG AGGCCGAATC GGACTTTGAA
TATGCATTGC AACTTGATGA CGAGTTTGCT GATGCATGGT ATGAACTTGC CTACTGCAAA
GATGTTCTTG GCAAACTTGA GGAGAGCACC TCATGTTACC GCAAAACACA CGATTATGAT
CCATACAATA TCAATGCCTG GTACAATAAT GGCCTGGTAC TGAGCAAAAT GAAAAGCTAT
GACGAGGCGC TTGACTGTTA CGATATGGCC ATCGCCATCT CCGATGATTT CAGTTCGGCT
TGGTATAACC GTGCAAACGT TCTTGCCATC ACCGGACGTA TTGAGGAGGC TGCTGAAAGC
TATCTGAAAA CGCTGGAATA TGAGCCCGAT GACATTAATG CGCTCTATAA TCTCGGCATA
GCCTATGAAG AGCTGGAGGA GTATACCGAA GGCATCATCT GTTATCGGCG CTGTATCGAG
ATCAGCAGTG ATTTTGGTGA AGCATGGTTT GCACTTGCCT GCTGCCATGA AGCGCTTGAA
GAGTACGAAG AGGCATTTAC AGCAACTCTG GAAGCGCTGA AATCAATACC CGACAGCATT
GAGTTTCTGC TGCTGAAAGC GGAGATCGAG TACAATCTCA ACGATCTTGA AAATTCCATG
GCAACTTACC GGCAGGTCAT AGCGCTTGAT CCGGAAAGTC CCCAGATTTG GGTTGATTTT
GCCATCGTTC TCAGGGAAGG TGGCCAGATC AATGCATCAA TCGATGCACT GCAGCAATCA
TTGAAGCTAC AGCCACTTTC GGCTGATGCT CACTTTGAAA TTGCCGCTGC CTATTTTGCC
ATGGGCGACA AACTCAGTAC CCTGAAGGCA CTAAGCAAGG CATTCAAAAT TGACCCCGAG
AAAAAAGAGC TTTTTCAGAG CACTTTTCCG GAACTGTACC AGCAGGATTC TGTCAGACGG
ATGCTCGGGA TTGCATGA
 
Protein sequence
MNVPDFFDDN RHNPAGASGR EQPDLDGLDS MFDSEDLLDR ISQYNEEGFQ LEALAVARRL 
EEIAPFNTET WFHLGNCLTL NGFFDDALEA FQRAAVFSPM DSEMRLNLAL AWFNTGNHDE
ALRELEDILV DSSIEKEFHY YRGIILQRLE RFVEAESDFE YALQLDDEFA DAWYELAYCK
DVLGKLEEST SCYRKTHDYD PYNINAWYNN GLVLSKMKSY DEALDCYDMA IAISDDFSSA
WYNRANVLAI TGRIEEAAES YLKTLEYEPD DINALYNLGI AYEELEEYTE GIICYRRCIE
ISSDFGEAWF ALACCHEALE EYEEAFTATL EALKSIPDSI EFLLLKAEIE YNLNDLENSM
ATYRQVIALD PESPQIWVDF AIVLREGGQI NASIDALQQS LKLQPLSADA HFEIAAAYFA
MGDKLSTLKA LSKAFKIDPE KKELFQSTFP ELYQQDSVRR MLGIA