Gene Plim_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1523 
Symbol 
ID9138223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1964036 
End bp1965307 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID 
Productpeptidase M50 
Protein accessionYP_003629555 
Protein GI296121777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.379776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATCCAC TCCAGCCCAT GGCTCCCAAG AAGTCATGGT TTGGAAGGAA ACAGCCTCAG 
GAAACGTCTG TCGATCAGGC TTTAGCGCAG GTCGATCCAC GACAGACAAA TCCGTCAGCG
AAGCCTTCGA CTTCGCTGCC TGAAACTGCA CTGAATACGG CCAACTCGAA GAGCAACTCA
CAGGTTCTCG AAACGAGTGT TCAATTTCCA GGTACCAAAC CCGCGAAGTC TCAGCGTGTT
GCACTCGACC GTGAGATGCT CCCAGGTGAG TTGGCGAGAA TGGCTGCGCT GCAGGCAGCT
GCCGCTCCTG CAAATGCCGC CAATACAGTC GATCACGAAC TGAAAGACGA CGACTTTCTG
GAGGAAGAGA ACGAGCAGCT TCGTCGAGAG GGCGAACTGG AAGCGAAGCG TCTGGCAGAG
CAGGAAAAAC AGTTTCGATC AAGACATTGG GCGATCTGGC TGTTTTTTAT GACCTGTGTG
ACGACGTTTC TCGCGGGCTT GAACAGTGGT GGCGGGCCAG GCCCGGCGAT GCCGGAAGAT
CTCCTGCGTG ATGGGCTCAT CTACGCGGGT TGCGTCATGG CAATCTTGTG TGCTCATGAA
CTGGGCCATT ACCTGCAAAG CAAGCGGCAC GGACTACCGT TCTATTATCC GTATTTCATT
CCGTTTCCCT TTGGGATCTT TGGAACATTA GGAGCGACGG TTTCTTCCAG TAAAATCAAA
TTGTCACGAT CAGCGTTATT GGACATCGCG ACGACCGGCC CGCTATTTGG CCTGATTGTC
ACTCTTCCGA TCGCACTCTA TGGTGCAGCA ACATCCATCC CGTTTCCGGC GAACGCACAA
GCCGGTTTTT CACTCAGCGA TCCACCAATC CTGCGATGGA TGATTGTGCT GACGCATCCA
CAGTTGGGGC CCAATGACGA TGTTCTCATT AACCCCGCCT TGCTGGCGGG GTGGTTTGGC
ATTTATTGGA CAGCTCTCAA TCTGGTACCA ATTGGTCAAC TGGATGGCGG TCAGATCATG
ACCGCACTGA TTGGCAGACG GGCGGAAATT GTGAGCAAAT TGACCATCGT AGTGGCAGTC
ATCTGGATGC TGTACTCGCT CGATCTCACC TTCAGCACCA TGCTGGGATT GATGGCGTTT
ATGGGCATTC GCAGTGCTGA GATTGAGAAT GAGTCTGTAG CACCCAGCAA AGTCAAAATG
GCTCTCGGCT GGCTGCTTTT AGCCTTTTTA TTGATTGGCT TTACACCCAT CCCATTCCGC
ATGGCTCCCT GA
 
Protein sequence
MDPLQPMAPK KSWFGRKQPQ ETSVDQALAQ VDPRQTNPSA KPSTSLPETA LNTANSKSNS 
QVLETSVQFP GTKPAKSQRV ALDREMLPGE LARMAALQAA AAPANAANTV DHELKDDDFL
EEENEQLRRE GELEAKRLAE QEKQFRSRHW AIWLFFMTCV TTFLAGLNSG GGPGPAMPED
LLRDGLIYAG CVMAILCAHE LGHYLQSKRH GLPFYYPYFI PFPFGIFGTL GATVSSSKIK
LSRSALLDIA TTGPLFGLIV TLPIALYGAA TSIPFPANAQ AGFSLSDPPI LRWMIVLTHP
QLGPNDDVLI NPALLAGWFG IYWTALNLVP IGQLDGGQIM TALIGRRAEI VSKLTIVVAV
IWMLYSLDLT FSTMLGLMAF MGIRSAEIEN ESVAPSKVKM ALGWLLLAFL LIGFTPIPFR
MAP