Gene Plim_4099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4099 
Symbol 
ID9140819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5256441 
End bp5258186 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content55% 
IMG OID 
ProductNHL repeat containing protein 
Protein accessionYP_003632109 
Protein GI296124331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.649178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACTT GTTCTAACCG AGCCTGGTTT CGACATTTTG AGAGTTGGCC GCAAAGATGC 
GGAGTCTGGC GATGGGATGG GAATGCTCTG GTAGGGTGCC TGGCTGCGAT CTTACTGTGT
CTATTTTTCG TGCCTGCGGT TGCTCAGGAA ACGTCTGACG GGCAATCACC GGCAGCGAGT
ACGCCTAAGC CAGCACCAGA AGCCGGCAAG CCAGAAAATC CGTTTCCGAA TCGCATTCCT
GCCCCTTCGC TCGATGGGGG AATCGAGTGG CTCAACACCA GCCAGCCCCT GTCACTGAAG
GATCTGCGCG GGAAAGTGGT GGTGCTCGAC TTCTGGACGT ACTGCTGCAT CAACTGCATT
CATGTGCTGC CTGATCTGAA GTATCTCGAA AAGAAGTATG GCAAAGAGCT GGTGGTCATC
GGTGTTCACT CCGCCAAGTT TGATAACGAG AAAGAGTCCG GGAACATTCG CAAAGCCATC
TTGCGTTACG AGATTGAGCA TCCCGTCGTC AACGATGCGG AGATGACCAT CTGGCGGAAG
TTCAGTATTC GGTCGTGGCC TTCTCTCGTG TTGATTGATC CTGAGGGGCA GTTTTGTGGT
GTCGCTTCCG GCGAGGGAAA TCGCGAACTG CTGGATCAAG TGATTGCCAA AGTCATCGAT
TATCATAGGG CGAAGGGGAC GCTGAACGAA AAGCCGATGG CTTTCGATCT CGAAAGCGGC
AAAGAAGCAG CCACTCCTTT GCGATTCCCT GGCAAGCTGC TCGTTGATCC GGCCCATGAG
AGAGTCTTTA TTTCAGACAG CAATCATAAT CGCATCGTCG TGGCATCGCT GGCCGGTCAA
CTCCTCAAGG TGATTGGAAG TGGAAAAATT GGCGCCAAAG ATGGCCCGGC TGAATCGGCA
CAGTTTGACC ATCCGCAAGG AATGGCACTG GACGGGAATA CGCTCTATGT GGCCGATACG
GAAAATCATC TGCTGCGGAC GGTGAACCTG ACCACATGGG AAGTTTCGAC ACTCGCAGGG
ACTGGTGAAC AGGCCCGCGG CCGCGATCGT GGGGGGGAGT TGCGAACCAC AGCGCTGAAC
AGCCCGTGGG ATCTTTACAT CCAACAGGGC GTGCTGTACG TCGCGATGGC TGGGCCGCAT
CAGCTCTGGT CGCATGCACT GGGAAGTAAG ACGATTCAGA ACTATGCGGG CTCTGGACGG
GAAGATATTA CCAATGGAAG CCTGGCTCAA TCGGCACTGG CGCAACCTTC GGGAATCACC
AGTGATGGCG AGTCGCTGTA TGTGGTCGAT AGCGAAGGTT CATCCATTCG CAAAATCACT
ACTAGCGAAG CAGACAAACT GGAAGACCCG GAGGGCAAAG TCACCACAGT GGTGGGAGCT
TCGGATCTGC CGCGAGGTGC GAGCCTGTTT GAGTTTGGCG ATATTGATGG CAAGGGATCA
GCAGTTCGTC TGCAGCATCC GCTGGGGATT GTCTTTCACG AGGGGAAGCT GTTTGTCGCC
GACAGTTACA ACCATAAGAT TAAAGTGATC GATCCGATCA AAAGAACATG CGAGAGCTGG
CTGGGGAATG GAAAGCCGGG GGCTGCACTT GCTCCGGTCC AGCTATCGGA ACCTGCGGGG
TTGGCAACTT ATGGCGGAGT TCTGTTCATT GCCGACACGA ATAACCATCG CGTCCTGAAG
GTCGATTTGA AAACGAAAGC TGCCACCGAG TTGAAGATCG AAGGCCTGAC AGCCCCCAAG
CCTTGA
 
Protein sequence
MMTCSNRAWF RHFESWPQRC GVWRWDGNAL VGCLAAILLC LFFVPAVAQE TSDGQSPAAS 
TPKPAPEAGK PENPFPNRIP APSLDGGIEW LNTSQPLSLK DLRGKVVVLD FWTYCCINCI
HVLPDLKYLE KKYGKELVVI GVHSAKFDNE KESGNIRKAI LRYEIEHPVV NDAEMTIWRK
FSIRSWPSLV LIDPEGQFCG VASGEGNREL LDQVIAKVID YHRAKGTLNE KPMAFDLESG
KEAATPLRFP GKLLVDPAHE RVFISDSNHN RIVVASLAGQ LLKVIGSGKI GAKDGPAESA
QFDHPQGMAL DGNTLYVADT ENHLLRTVNL TTWEVSTLAG TGEQARGRDR GGELRTTALN
SPWDLYIQQG VLYVAMAGPH QLWSHALGSK TIQNYAGSGR EDITNGSLAQ SALAQPSGIT
SDGESLYVVD SEGSSIRKIT TSEADKLEDP EGKVTTVVGA SDLPRGASLF EFGDIDGKGS
AVRLQHPLGI VFHEGKLFVA DSYNHKIKVI DPIKRTCESW LGNGKPGAAL APVQLSEPAG
LATYGGVLFI ADTNNHRVLK VDLKTKAATE LKIEGLTAPK P