Gene Plim_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0142 
Symbol 
ID9136796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp191177 
End bp192547 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content59% 
IMG OID 
Productprotein of unknown function DUF323 
Protein accessionYP_003628193 
Protein GI296120415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.741056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATGGC TTATCGAACA GGCTGGTTCA ACAAATCCCA CTGATTCCGC CGCACGAGTT 
GCGAACGGCT GGCAAGGAAT CAGGCTGTGC GCCCTCCTGT GGCTGGCCCC CTGGTTGCTG
TCAGGTTGTG GTGGTGGCAC AAAACCACCA GCACCAGCAG CTCCGGTGGC TCCACCACCT
GTCGCCATCA AAACGGTGAA GCCCGCTGCT CCTGAACCCG CTGAGAATCC GCCGGGCGAA
GCCGCACCCG TCGAAGGCGA AGCCACAGCC GGCCCGGCCA TGGCTCCACC GGGAGAACAT
CCCGCAAATG TGTTTGACTT CGTCGCACCT GGCTCTGTCA ACAGCGTGAC ATCATCAACA
CCACTTCCGC ATGAAATCGA TCAGTTTGTG ATTGCACAGG TGGCAGATCA GGCAGGTGCA
ACCAGCTTTG TCGTGACCGA AATTCCCGCC ACCACCGAGA CGGGAAGCTC GATCGATGCT
TCGGGAAACC CTGCGAACCT CCAGGGAAAT CGTTTACCAA TTGGGTTCAT GTCCGTTCCG
GGGACAGGGT TATCGCCGGA TGGCTGGCCT AAGCGAATCA TCTGCCAGTA CGACGGCAGC
CTGATGGCCT ACATCCCACC CGGGCCTGCC AGGCTCGGCT CCAATGACGG CCCCGCCAAT
GCCCGGCCTG AAGCCACGGT TCTACTGGAT GGCTATTACA TCAATGTGTT TGAAACAACG
GTCGCTGAGT ACAAACGCTA TCGCGATGAG ATGAAGGCCA AGAACAAGAA CAGCTTCGCT
GCGATCAATG AGACGGCTGA TCCACGCCAG CCCGTACTGG GGATTCCGTG GGGTGTGGCC
AGTGCCTACG CCAAGTGGTC TGGTCGGGAA CTTCCCACAG AGGCGGAATT TGAAAAAGCC
GCCCGTGGCC CGGACGGATT TCGAGCTCCC TGGGGCAATA CCCGGGCGAT CTGGCCCGAG
CCGCGAACGA CCAAAACATT GGCCAACGTC GGCAAGTTCA GCAGCGATCA GAGCATTTAC
GGCATCTACG ACCTGGCGGG AAATGCTCAT GAATGGGTCG CCGACTGGCA CGACGACAAC
AGCCACGCCG AAGCGGCCAA GTCGCGAGAC GGCGTGAAGA ACTGGACAGG TGCCAAGAAG
CCAAAAATTA CCAGCCAGCA CACCGTGAAA GGCTGTTTAA GCGATTGGGA TGTCACAGCC
CGGGAAGGTC GATTGATGAC CGACAAATTC CCCGATGTCG GTTTCCGCAC AGTCCTGCGC
GTTGGAGGGG GAAATCCCGG ACAACCTGCA GCCACTCCTC CCAACACACC CAACACGAAG
CCCGCAAACC CCAACCGCCC CCCCAATCCA CCACGCAACA ACGCATTTTA G
 
Protein sequence
MAWLIEQAGS TNPTDSAARV ANGWQGIRLC ALLWLAPWLL SGCGGGTKPP APAAPVAPPP 
VAIKTVKPAA PEPAENPPGE AAPVEGEATA GPAMAPPGEH PANVFDFVAP GSVNSVTSST
PLPHEIDQFV IAQVADQAGA TSFVVTEIPA TTETGSSIDA SGNPANLQGN RLPIGFMSVP
GTGLSPDGWP KRIICQYDGS LMAYIPPGPA RLGSNDGPAN ARPEATVLLD GYYINVFETT
VAEYKRYRDE MKAKNKNSFA AINETADPRQ PVLGIPWGVA SAYAKWSGRE LPTEAEFEKA
ARGPDGFRAP WGNTRAIWPE PRTTKTLANV GKFSSDQSIY GIYDLAGNAH EWVADWHDDN
SHAEAAKSRD GVKNWTGAKK PKITSQHTVK GCLSDWDVTA REGRLMTDKF PDVGFRTVLR
VGGGNPGQPA ATPPNTPNTK PANPNRPPNP PRNNAF