Gene Plim_2514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2514 
Symbol 
ID9139225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3273629 
End bp3275149 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content53% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003630539 
Protein GI296122761 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCAG CTCGGATGCA GTTTCTACCG CCGAATGCGG GATCACTTTC CCACACGGGC 
TGTGCCAGTC GGTGGTTACC AGGCGGCTGG TCTCGTGAGG TTTGGCTGGC TTTGTTCTGC
GGCGCCGTTT TGCTGGCGAT GGCACCGACA TCGCTGGTGC TGGCAGACGA AGAAAAGGCC
ACACCTCCGG ATGCTCGGGA GAAAGAATAT TACGAGTTGA TGAAACTGTT TGTCGATTCG
TTTGAACAGG TCGAACGCAA CTATGTGAAA CCTGTGGATC GCAAGCAACT GCTGGAATCG
GCCTTGAGGG GCATGCTGGC TGACCTGGAT CCGTATTCGT CTTACATCGA TTCATCGGGT
CTGGCTGAAT TCACTCAGCA GGTGGAGCAG GAGTTTGGCG GGATCGGCAT TCAGGTTTCC
CTCGATCCCA AAACGCGGCA GCTCGTCGTC ATGACACCAC TACCCGGGAC ACCGGCCTAT
AAAGCTGGAA TTCGTGCCGG AGATCGAATT CTTTCGATCG CGGGAAAACC GACAGCCGAG
TTCGCCGATG GTAAAGAACT CGAATCAGCC GTCGTGCTGA TGAAAGGTAA ACCCGGCGAG
ATCGTGAAGC TGAGTGTGCT TCACGAGACC GAATCGACAC CCGTGGAGCT GGACGTCGAA
CGGGCCATTA TTCGTACTCC TTCCGTGCTG GGTGATAAGT ACAACGAGGA CGGTAGCTGG
TCGTTCTACC TCGATGGGCT CGACAAAGTG GCTTATGTCC GGCTCTCTCA GTTCGGCAGG
AACAGTGCTG ACGAAATCAA ATCAACTCTG AAGGAGTTGG ATCAGCAGGG GATGAAGGGC
TTGATTCTCG ATCTGCGATA CAACCCGGGC GGGCTGTTGA CGGCCGCCAC TGAAATTGCC
GACATGTTTC TTGATAGTGG TGTGATTGTG AGCACTCGTG GTCGCAATAC CGAGGAACAG
GTTTTCAAGG CAAAGAAGTC GGGGACGTTT CGCGATTTTC CTGTTGTTGT GCTGGTGAAC
CGCTTCAGTG CTTCCGCCAG TGAGATTTTA AGTGCCGCAC TGCAGGATCA TGATCGAGCG
ATCATTGTGG GTGAGAGATC GTGGGGGAAA GGGAGCGTAC AGAATGTCAT CAATGTCGAT
GGTGGCAAGA GCGCACTCAA GCTGACCACA GCCAGCTATC ATCGCCCCAG TGGTAAGAAC
ATTCATCGCT TCCCTAATTC GAAGGAAACC GATGAATGGG GTGTCAAACC CACCGAGGGC
TATGAAGTGA AAATGACACC TGAGCAGATG CAGAAATATC TGGAATACCG TCGGCAGCGC
GATGTCCTGC GGAAAGAAGC CGGAGCTTCG ACGTTTGATG ATCCTCAGTT GGCAAAGGCT
GTGGAGTATC TGAAGTCCAA AATCTCGCCG GCTGCAGAAG CCAAGCCAGA AGAAGACAAA
GATAAAGCCA AGGCTGAAAC ATCCCCGGCT GGGACTGCTC CTCAAGGTGA TTCTGCTTCC
AAAGAAAATT CTGCCAGGTA G
 
Protein sequence
MFSARMQFLP PNAGSLSHTG CASRWLPGGW SREVWLALFC GAVLLAMAPT SLVLADEEKA 
TPPDAREKEY YELMKLFVDS FEQVERNYVK PVDRKQLLES ALRGMLADLD PYSSYIDSSG
LAEFTQQVEQ EFGGIGIQVS LDPKTRQLVV MTPLPGTPAY KAGIRAGDRI LSIAGKPTAE
FADGKELESA VVLMKGKPGE IVKLSVLHET ESTPVELDVE RAIIRTPSVL GDKYNEDGSW
SFYLDGLDKV AYVRLSQFGR NSADEIKSTL KELDQQGMKG LILDLRYNPG GLLTAATEIA
DMFLDSGVIV STRGRNTEEQ VFKAKKSGTF RDFPVVVLVN RFSASASEIL SAALQDHDRA
IIVGERSWGK GSVQNVINVD GGKSALKLTT ASYHRPSGKN IHRFPNSKET DEWGVKPTEG
YEVKMTPEQM QKYLEYRRQR DVLRKEAGAS TFDDPQLAKA VEYLKSKISP AAEAKPEEDK
DKAKAETSPA GTAPQGDSAS KENSAR