Gene Plim_4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4229 
Symbol 
ID9140951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5403072 
End bp5404265 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content53% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003632236 
Protein GI296124458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.543782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGAC TCGGTGCACG GGCGTTAGAA CTGGCGATGC AACTGATTCG CATCCCGACG 
GTCAGCCGAG ACAGTAATCA TGCCGCGACG TTGTTTCTAG AAAACTGGTT GAAAGAGCAT
GGCTTTATCA CGGAAGTTCT CACATACCAC GATTTTAAAG GTGTTTTGAA ATCGAGTGTG
ATCGGACGAA GAGGCCCCGC AAAATCGACG GGTGGCGTGG CCTACTTCTG CCACACCGAT
GTGGTTCCTG CCGAAGGATG GGTGGGCCTT AAAGAGAGTG ATTCTCTACA GGGCCCGACT
CAACCACAAC AACCTTTTAC ACCCGTCGTC ATGGGTGACC GCCTGTATGG GCGTGGTGCC
TGTGATATGA AAGGTTCAGC CGCCGCTTTT CTCGCAGCGA TTGAGCAATG CCCTGTCGAG
GAACAGGCAG CACCGATTTA TGTGGTCGCC ACAGCTGATG AAGAAGTGGG ATTCTATGGA
GCCGCCGATG TGGCTGCTCG ATCACAACTT TATCGACAAC TCGTGGACGA AAAAGTGGCT
GGCATTATTG GTGAACCCAC CGAGCTGAGT GTGGTTCACG CACATAAGGG GATGTACGTT
CTCAAGGCAA CCTCTTCAGG CAGAGCCGCC CATTCGAGCA CACGCGAAGG TCTCAATGCC
AATCTGGCGA TGATCGATTT TCTGTATGAA ATGAAGCGAT TGCATGACCA GACACTGACT
GATCCAGCCT GGCTCGATGC ACGCTTCGAT CCACCCTGGA TCAGTTGGAA CATAGGGATT
AACGACTTCA CGCATGTGGT GAATATGACG CCCGCACAAA GTGTCTGTAC GGTCTCGTTT
CGTCCCATGC CGGATCAGCA GCCGGATGAA CTGGTGGCGC AGGTCGAACA GATTGCGGCT
GCCTGTGGAC TGACGTTTGA AGTCATTCGC CGCGGACAAC CACTCTATCT CGACCCGGAA
TCGCCCTTTG TCAAAACGAT GTGCGAGCTT TCGGGGTCAG GATCATCGCA GACAGTCAGT
TACGGTACTG ATGGCACGAT GTTTACAGAA ATCGAACAGA TGATCGTCCT GGGCCCGGGT
TCGATTCGAC AGGCACATAC TGCCGATGAA TTTATCACTT TAGAGCAACT ACAAAGCGGG
GCCGAACTTT ACAGCCGAAT TATCCGGCAA CTGGTCTCAA ACCAGAGTGA GTGA
 
Protein sequence
MNGLGARALE LAMQLIRIPT VSRDSNHAAT LFLENWLKEH GFITEVLTYH DFKGVLKSSV 
IGRRGPAKST GGVAYFCHTD VVPAEGWVGL KESDSLQGPT QPQQPFTPVV MGDRLYGRGA
CDMKGSAAAF LAAIEQCPVE EQAAPIYVVA TADEEVGFYG AADVAARSQL YRQLVDEKVA
GIIGEPTELS VVHAHKGMYV LKATSSGRAA HSSTREGLNA NLAMIDFLYE MKRLHDQTLT
DPAWLDARFD PPWISWNIGI NDFTHVVNMT PAQSVCTVSF RPMPDQQPDE LVAQVEQIAA
ACGLTFEVIR RGQPLYLDPE SPFVKTMCEL SGSGSSQTVS YGTDGTMFTE IEQMIVLGPG
SIRQAHTADE FITLEQLQSG AELYSRIIRQ LVSNQSE