Gene Ppha_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1121 
Symbol 
ID6462683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp1155112 
End bp1157094 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content50% 
IMG OID642727367 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002018017 
Protein GI194336223 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATTA TCCATCCGAT AAACGATTTT TTGTTTGAAC TTTTCCCCAA GGCAAAAGAG 
GCGGGAAACA ATATTGATGT ACTGAAGCAG GAGATTGAGA CCTTTTATAC GGTAGGGCCT
TTCAAGCCAG CGATTACCAT CGTGCACGGC ATCATCGATA TCACGATTGA GGGTGACCTG
ATTGCACAGC ACAACAGCCG CTATAACAAA GTCCTTGCAC TCTGTGATGT CCGCCAATAT
ACCGAGGCCA AAGAGCAGAT CAGTCAGCTC ATCAAGGAAG CTCCTCATAT TTCAGAGTAC
CACCGGGTAC TTGGCCAAAT CCTTTCAGAG CAGGGCGAAC AGGATGATGC CATCAACAGC
CTTATCGATG CACTGCGCTG GGACCCGAAA AATGGGTGGG CGCTTATTAT TACGGGCAAC
ATTATGGCCC GCTATCAGCA CGACCTTGAG ACGGCTATGA AATACTATGA GTGCGCTCTG
CAGCATAAAC CCGACGATTA TGTTTCCATG ACGCTGATTG CCATCAACCT GATCCAGAAC
GGGAATCCGG ATGCAGCACG GCAATACCTC GACAGGGCTT TTGCGGTTAA TCCCGCTTAT
CCGAATATCT ATTATGCCTT TGCCCTCCTT GCAGAGGCTG AACATCATTC CAGGGAAGCG
TTTGAGATGT CGATTGTTGC GCTTTTCAAG AATCCCAAAA AGGATCTGCT CTATCAGCAA
TCGCTGAAGA GCGCCATGAA GGCGGCAAGT GAGATTATTG ACGAAGAGGT CGGTGGTGAT
CTTGTCAATG CCTATGCCTC GACGCTGGAG GAGGAGTGCG GCAAGAAAAT CGTTATCGAA
ACGGATACCG GGTTAAAGAG CGCAGCCAAA ATCGAGTGTG CCGAAAACCA CGACCGGAGC
TACCACCTCG TCAAATACAA TCCAGACTAC CCGGCGGTTC ACCATCTGAT CATGCATGAG
CTCGCGCATC TGCATTTTGC GGCTCAGGCA CGACAGACAG GGAAAAACAA GCTCTTTGTC
AGCAATGATG AGAACAAAAA GCGATTCCTG GGTTCCCATG AAAAAGATGC CAGGATGTTG
AGCAAAAAAG GTTACGACGA TGCGCTCATC GAGAACTATT ATGCCTTGTT TCAGGTGCTG
AACGCCCAGA TCTTCAACAC GCCGATTGAT CTTTATATCG AGGATTATCT GTTTCAGAAC
TATCCGGCTC TCATGCCCTA TCAGTTTCTT TCGCTGCTTG GCCTTATACA GGAAGGAATT
TACGCAACGA CCGATGAACA GATTCTGACC ATTGCACCAC CGGAAATCCT CTCGAAATCA
AAGATATTTA ATCTGGTCAA GGCACTGCAC TTTCAAAAGC GGTATGGGGT GAACCTTATA
GAAGAGCACA AGCCAACCAC CGCAGAAAAA GAGCAGGCTG TAACCTTCTA CCGCGAATAT
GAGGAGTGCC GAACCCGGCA CACGCCCGGC GATGAATATG AACTGCTTCA GCGGTGGGCA
AAGCAGTTAT ATCTGCAGAA CTTTTTCGCA CTCATTGAAG AGCCCGATTA CCGGGAGCAA
AGCGACCTTA TCGAGCGCCT TTGCGCTGAA GTGCAGGTCG ATTTGCCCGG CAAAGGTGAT
GCCGCCCATC CGGCAGACCA GAAGAGGCAG CTCTGCACCG AGGCGCATCA GGGCAAAGAT
ATTAATATGG CCGTAACCCG GTTCATGGTC GAAGCACTGC ATTACTTCAA AAATCTGACC
GACTCGGAGA TCACTGCCAT TGCCATCCAA ATAGGCCTGA TGTGTGGGGA GGGAATAAAC
CCGGATGCAG AAGGCTACAC CATCCCGCTC ATAGCCGCCA GAACCTTCAC CGGATACCAG
GTGCTGGCCT ACTATTACGT GAGCTGGGCA AAGGCATTCC CTGAGTACCT GCAGCAACTC
CAGCTCCCGT TTGACAAAGA GTACGAATTT GCTCTTGAAA TGTTGGGGAT GGGAGGGGGA
TGA
 
Protein sequence
MHIIHPINDF LFELFPKAKE AGNNIDVLKQ EIETFYTVGP FKPAITIVHG IIDITIEGDL 
IAQHNSRYNK VLALCDVRQY TEAKEQISQL IKEAPHISEY HRVLGQILSE QGEQDDAINS
LIDALRWDPK NGWALIITGN IMARYQHDLE TAMKYYECAL QHKPDDYVSM TLIAINLIQN
GNPDAARQYL DRAFAVNPAY PNIYYAFALL AEAEHHSREA FEMSIVALFK NPKKDLLYQQ
SLKSAMKAAS EIIDEEVGGD LVNAYASTLE EECGKKIVIE TDTGLKSAAK IECAENHDRS
YHLVKYNPDY PAVHHLIMHE LAHLHFAAQA RQTGKNKLFV SNDENKKRFL GSHEKDARML
SKKGYDDALI ENYYALFQVL NAQIFNTPID LYIEDYLFQN YPALMPYQFL SLLGLIQEGI
YATTDEQILT IAPPEILSKS KIFNLVKALH FQKRYGVNLI EEHKPTTAEK EQAVTFYREY
EECRTRHTPG DEYELLQRWA KQLYLQNFFA LIEEPDYREQ SDLIERLCAE VQVDLPGKGD
AAHPADQKRQ LCTEAHQGKD INMAVTRFMV EALHYFKNLT DSEITAIAIQ IGLMCGEGIN
PDAEGYTIPL IAARTFTGYQ VLAYYYVSWA KAFPEYLQQL QLPFDKEYEF ALEMLGMGGG