Gene Plim_3586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3586 
Symbol 
ID9140304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4617023 
End bp4620256 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003631598 
Protein GI296123820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGCT GTGTCCCACA GGACGGCTCG ACAAGAGGGT ATGACCTCGG TGCCATTAGG 
GCCAAAGCGC GCCTGGCCTG CTCAAGCCTC TTATGGCTGG CCATGATCTG TGGTTTCCAA
TCGTTCGCTC TTCAGCCAAC GTTGGCACAG CCGCCTGCCA ATTCAGCAGA CCTGAGTGAC
TTTAATAACG GCGTGGGATT GTACCGCCAG AGTCGTTGGG GAGATGCTGT CGAATCCTTT
CGTCAATTCA TCAAAGCCAA CCCTCAAAGT CCGCGAGTCC CGGAATCGCA AATCTACATT
GGTCTGGCTC TGATCAACCA GCAGAACTAT GTCGAGGCCC GGACCGTCCT GCGAGAGTTT
CTTAAAAACT TCCCCGAAAA CAGCAACGTC GCTCAGGCAC GTTACCGGGT GGCGGAATGC
AGTTTTTTGC TCAATGATTT CCCGGCTGCG AAGCAGGAAC TGCAAAGCTA TCTGGAAAAG
TATCCCCAGG ATGCCCTGGC ACCACGGGCA CTTGCTTATC TGGGAGATGT ACAGCTCCAG
CTCAAAGACC CGCAAGCTGC GATCACCACC TTCGAGGAAG CCCGTAAGCG TTTCCCAGCT
GGTGCCCTTG CGGATGACAT CGAGTATGGA CTGGCTCAGG CCTATTCAGC GGCTGGCAAA
ACCGCAGAAG GACAAAAGCT GCTCGATGCC ATCGCCGCCA GACAGAATCA TCCTCATGCA
GCTGATGCTT TACTGCTGCT GGGAAATCAA GCCTCGACAG CCAAGGATTA TCCCGTTGCC
ATTCGCCAGT TTGAATTGCT GGCCGAGCGA TATCCTCAAA GCCCGCTCGC AGAGACAGCA
CTCACTAACC GCGGCTATGC GCTCTTTCAG ACCGGTCAGT TCGATGCCGC AGCCGCGCAG
TTTGAAAAAC TTGCCGCTCA GTTGGAAAAG ACGTCATCGA CATGGACGCC TCAGCAGAAG
CAACAGGCGG CGAGCTATCT GTACTGGCAA GGGTTGAGCC AGAAAAATGG CAATCAACTG
GAAGCTGCCC TCGTTACTTT TGCAAAGAGC TTTGACCTGG CCGGTGGATC GTCCATCGCG
GAAAGCGTGT TGTATCAGCA GACACTTACC GCCAGACAAC TGGGACAAAC ACAAAAGGCA
GAAGCGCTGG CACTCCAGCT TGTAGAGAAG TGGCCGCAAG GTGATTCCGC CGATGATGCG
TTGCTGATGG TGATTGATCT CTCGCTGGAT CGTCAGGATG CCGCCAAGAC TCAGAGCTTG
ATCGCGCAGT TTCGCAAGTC CTTCCCGGAA AGCTCACTCA AATGGCATGT GCGACTGCTG
GAAGGTCGGC GGGATCTTGA AGAAGGGACC CGCACGAGCA ATGTGGAAGC TCTCCAGCGG
GCTCAAGTGG CCTTTGAAGA AACGCTTCAA GGAGCCACAT CGATCGAACT GAAAGATCAG
GCCAGATACT TTTCAGGACT GGTTCTGCAA CTGCAGGGCA AACTTCCCGA AGCGCGGGAA
ATGATTGCCC CTCTGGCGGC ACAGATCAAT GCACAAAGTG TTCGCCCGGA GATTATCGAA
TCGCTCGTGA TCGAGATGTC AGTCCTCTTC AGCCTTGGCG ACTTTGAAGC CAGTGCTGCG
GTCGCAGGTA AATACCTGAG TATGTTCCCT CAGGCCACTC AGCGAAGGCG GGCCTATGCG
CTGCAAGGCC TGGCATACGC CAAAGCGCAA CAGTGGGCCA AGGCCGAAGC TGTGATCAAG
CAGTTTGAAG CCGAGTTTCC TGGTGATCCT GCGGTCGCTG CCGCTTTGAT GGATCAGGCC
GAAGTGGCAG AAGCTGCCAA GCAATGGCCA GTCGCTCTCG CTGACTTCGA GAAACTGAAG
CGACTGGCCG CAGGTACGAC CAATGAACCA TTTGCGTGGC GAGGAACAGG CTGGTCGAGA
TTTCGTCTGG GCGATTACAA ACTGGCGGCT GAAGAATTTG CCCAGTTATC GGCAAAATTC
CCGCAGCATC CGCTTCAGGC CGAGGCGATG TATTACGAAG GGGAGTCTTG GCTGCTGGCG
AAAGAGACCG AGAAAGCACT CAAGGTCTTT CAACAGGCTT TTGATCGCTT CACGCCCAAA
GACGCCGCCA GCGTGAAAGA GGAATTGAAA GCCCCGGTGC TGTTTGGATA TCGCTCGGGC
CTGATGATGG CCCGCACTCT CGAACAGACC GGGCGACTTG AACAGGCCGA TCAAGCGTAC
GAAACCTTGC TCAAGAAGTT TCCCAAAGCG GAAGTGTTTG ATCAGTTACT CAATGAATGG
GCTCTGATCA ATTATGAGGC AGGTCGATTT GAGCAGGCCG ACAAGATTTT TGCGCGCTTA
GTCGCTGAAT GTCCGGAAAG CCCACTGGCT GATAACGCGA AGCTGAGCCT GGCGGAGAGC
GACTTGATTC AAGGTGAATT TGCGCGGGCG AGAAAATCAC TCGAAGAACT GCTGGAAAGT
GAACAGTCCG ATGTCAGTGT GAAAGAACGG GCTCTTTATC AACTCGTCGT GCTCGCCATC
GAACAGCAGC GCTGGGATGA CGTCAGAAAA CTGGGTGGTC AGTTTGTGGC CACCTATCCC
GAAAGCCCCC AAAAGCTGCA AGTCGCAGTG GCACTCGCTG AATCTCAACT GGCCACTGAT
CTCAGTGTGG AAACTGCCAC CGCGCTACTT CCCCAACTTG ATACTCTGCG CGAAACCATT
CGTGCTGCGA GAAAACAGGG CGAAATCGAA GACTGGCATG GCAGGCCCTG GGTACTCTGG
GCAGAGGCCC GGTTACGCCA TCGCAACTAT GAAGGTCTGA CAGACGCTGC GGAAGAACTT
GATGGCTGGC AACCACCCGG ACCTGCCCGC TGGCAGATTC GAGAAGTTCT TGGCCGAGCC
TACAAACAGC AAGCCAAATT TGATGAAGCC CGAGCAGCTT TTCAGATCGT GACCAAGGAC
CCGCAAAGCC AGCAGACAGA ACTCGCTGCC AAGGCCCAGT TTCTGCTCGC AGAGACCTAC
TTTCTCCAGG AGAACTGGAA GCAGGCATTC CTGGAATACC AGAAGGTCTA CTCCAATTAC
GCCTTCCCCG AATGGCAGGC GGCCGGTTTA CTTCAGGCAG CCAAATGCGA CGAACAGCGT
AGCCAGTGGA AGGAAGCCAT TGCGACCTAC GAACGGCTGC TGCGCGAGTT TCCGAATGTC
AGTTACGCCA CCGAGGCGAA AGAGCGACTG GAAGCGGCTC GCAAACGCAA TTGA
 
Protein sequence
MNRCVPQDGS TRGYDLGAIR AKARLACSSL LWLAMICGFQ SFALQPTLAQ PPANSADLSD 
FNNGVGLYRQ SRWGDAVESF RQFIKANPQS PRVPESQIYI GLALINQQNY VEARTVLREF
LKNFPENSNV AQARYRVAEC SFLLNDFPAA KQELQSYLEK YPQDALAPRA LAYLGDVQLQ
LKDPQAAITT FEEARKRFPA GALADDIEYG LAQAYSAAGK TAEGQKLLDA IAARQNHPHA
ADALLLLGNQ ASTAKDYPVA IRQFELLAER YPQSPLAETA LTNRGYALFQ TGQFDAAAAQ
FEKLAAQLEK TSSTWTPQQK QQAASYLYWQ GLSQKNGNQL EAALVTFAKS FDLAGGSSIA
ESVLYQQTLT ARQLGQTQKA EALALQLVEK WPQGDSADDA LLMVIDLSLD RQDAAKTQSL
IAQFRKSFPE SSLKWHVRLL EGRRDLEEGT RTSNVEALQR AQVAFEETLQ GATSIELKDQ
ARYFSGLVLQ LQGKLPEARE MIAPLAAQIN AQSVRPEIIE SLVIEMSVLF SLGDFEASAA
VAGKYLSMFP QATQRRRAYA LQGLAYAKAQ QWAKAEAVIK QFEAEFPGDP AVAAALMDQA
EVAEAAKQWP VALADFEKLK RLAAGTTNEP FAWRGTGWSR FRLGDYKLAA EEFAQLSAKF
PQHPLQAEAM YYEGESWLLA KETEKALKVF QQAFDRFTPK DAASVKEELK APVLFGYRSG
LMMARTLEQT GRLEQADQAY ETLLKKFPKA EVFDQLLNEW ALINYEAGRF EQADKIFARL
VAECPESPLA DNAKLSLAES DLIQGEFARA RKSLEELLES EQSDVSVKER ALYQLVVLAI
EQQRWDDVRK LGGQFVATYP ESPQKLQVAV ALAESQLATD LSVETATALL PQLDTLRETI
RAARKQGEIE DWHGRPWVLW AEARLRHRNY EGLTDAAEEL DGWQPPGPAR WQIREVLGRA
YKQQAKFDEA RAAFQIVTKD PQSQQTELAA KAQFLLAETY FLQENWKQAF LEYQKVYSNY
AFPEWQAAGL LQAAKCDEQR SQWKEAIATY ERLLREFPNV SYATEAKERL EAARKRN