Gene Plim_4208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4208 
Symbol 
ID9140929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5379088 
End bp5382093 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003632215 
Protein GI296124437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0104266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTTG TCTTGGATGT CGGGTTGATC TCCGCGAAGT TTTTCTTCTG TCTTCTACTG 
GCTCTTTTCT GTGTGGGAGG AGTTGCCAGT CGGAATCTGT GCGCGGCCGA TGAACCATCC
CCGACTGTCA CATGGCGCGA GGCCTACGAG CATCTTCAGA AAGGTCGCTA CGAAGAGGTC
GAAGAAGCGT ATGAGTTCCT GAAAAAGCAG TCGGCCACAA AGGATGATCT CCCCGGCGAA
TTCGCCAATA GCCCACATCC ACTCACAGGG CCACAGTATG CCGAGATCGC GCTGGCGCGC
ATTGATCTCG AAACCGGACA TCGAGCCCGG GCCTACGAGC GGTTGGAAGC ACTCAAGAAA
TTGCATTCCG AACAACCGCG GATTCTCGGG TTGCTCGCGT GGTATGCCTT TTTGAATGGC
AAACTGGATG TCGCAGAAAC AAGCGCACAA GAAGCCATTC GGATAGAAGC GGATGAACTC
TGGGCACGTC GAACGCTGGC AGAAGTTTAT CAGGCGACCG GTCGATTAAA GCAGGCGGAC
GAAGGCTGGC GGTATTTCGT GAGGTATTAC AATCGCGTTC AGCCAGAAAA AGCGGAAGAG
CTGTTGCTTG CTGCTGAAGG TTCGTTGGCG TATGCCCGCT GGCATACCGG GAAACAGATT
TTTGATTTCG TGCTGAATAC GCTGTGCATC GATGCTCTCA AAGCCGATCC ACTTTTCTGG
CAGGCTCATG AATTGAGTGG CCGGCTCTTG CTGGAGAAGT ACAACAAGCC GCAGGCTCAA
CAGGAGTTCC AGGCTGCTCT CGCGATTAAC CCGCGGGCTG TGACTGTGTT GCTGGGCAGG
GCACAAGCTG CGGCCCAGGA TTACGACTGG GATGAATCGA ATCGACTGGC CAAAGAAGTG
CTGAAAAATG CCCCTGGCGA ACCCCTGGCA CATACGCTTC TGGCAAGATC CTTGCTGTTC
TCGAATCAGC CCGAGGCAGC TCTCGAACAG CTGCAACTGG CAAAAGCGAT CTGTCCGACC
GATCCCACCA CGACGGGCCT GATTGTCGCG GCCCAGATTC AACGTGACGG CATTCGTTCA
TTGCCCCGCC TGCAGCTACT TCTCGAATCA ATTGATCATA TTGCTGATCT TCCGGCAGCG
GCAGATGCTC CCGAGGCCTA CGAGACCACG TTGATTGCCG CAGCGAAGAT CAACAGTGCC
TGCGGTGAAG TTTTAGCCAC AACGGGCGAA GCTCTCGAAA TGCATCGCAA GTTTGAACTG
GCCGAAAAGT TTTATCGGGC GGCACTCGCC GTCATGCCTC AACTCACTGC TGCCAGGAAT
AACCTGGGGA TGCTGACGCT GCAGATGGCG CGGGTGGATG AAGCCCGCAC CATGCTCGAT
CAGGCATTCA AGAGCGATCC GTACCACATG CGTGTGAGCA ATATGCGTAA GGTCATTCGT
CAACTGGATG GCTACGCGAC ACTTTCGACT GATCACTTTG TAATTCGCTA TGACAACGCG
CAGGACGAGT TGCTCGCACG CTATATGTCG AAGTTTCTGG AAAACGAAGT CTATCCGCAA
CTGGTGAAAC AGTTCGGATA CGAACCATCC ACACGAACCA CCATCGAGAT TTACAGCAAA
GGGAATGGTC AGACGGCTCA TGAATGGTTC AGTGCCCGGA TGGTCGGTTT GCCCTGGGTA
CAGACGATTG GTGCCTCCAC GGGGATGATG ATTGCCCTGG CTTCACCGAA CGGGCTGAAT
GAACCCTATA ACTGGTCACG CGTCATTCGG CATGAGTATG TGCATGTGCT GACATTGCAG
CAGACACAGT TCAACATCCC CCACTGGTAT ACCGAGGCTC TGGCAGTGAG GAACGAAGGT
TATCCTCGTC CAGCCGAATG GAATGACATG CTTCGCAAGC GAGTTCCCCG GCGGGATCTG
CGGAATCTTT CCAATCTGAA CCTGGGGTTC ATCAGTGCCA AAAATGGCGA TGACTGGAAT
TTTGCGTATT GCCAGAGCGA TCTCTATGCG AACTATCTCG TCGAGCGGTT TGGTGAGCCT
GCACTGGAGA AACTTTTGTT GGCCTATCGA GCCGGCAAAA CGACTGAGGT GGCTCTGAAG
GAACTGTTTC AGACCGACAT CAAAGACTTT GAAGCGGGAT ACCTGGATTA CCTCGACAGG
ATCGTGGCAC AGCTTCCGGG GCAGGCTGAG GCAAGTATTG AATATTCAAA AAGTGAGGTG
GAGGCGGCTT TAGAGAAAGA ACCTCGGAAT GCCGAGTGGC TGGGCCGGGC CGCGATGTTG
AAAGCCAAGG ATCGTCGCCG GGATGAAGCC CGTAAGCTGG CTCGCGAAGC TTTGGAGATC
GATCCACATT GTGCGACAGC CGCCATCGCC CTGGCAGAAC TGGATCTTCG CGGAGAAGCA
CCCGAGAAAG CCATTCAGAA GCTCGCGTCG GCTCTGGATA ACACTCAGCC AGATCATCAG
CTCCTGCAGC GACTTCTCCC ATTATTGATT CAAACCAAGC GATGGGACGA AGCCCTCAAG
TATGCGTCTC TGGCGGAGAC AACGTTTCCG AGTGACATTT CATCAACGGT AGCGCTGGCT
GAAATTCTGC CACATTTCAT GGATCTGCCC AGATATAAAG CTGTTCTCGA AAAGCTGTCT
ATGTATGAGA ATGATGAATC TGAGTACCGG CTCCAGCGAG CAGAAATTGC CTGGAAAGAG
AGCGATTTGC CGAACGTGGC CCGGTATGCC GCCATGGTGC TGGAAATCGA TGTGACAGAA
CCGAAAGCTC ATTGGCTGCT GGCCGAAGGG TTATCTGGCG TCAAGCCGGA GGAAGCTCTC
GAAGAGTTTT CGATCGCCTG CAAACTCGAT GATGAACTGG CCGAAGCGTG GGCCGGCTGG
GCGGTTCTGC TGCAAACGCG CGGCCAGGTA CAAGAAGCCC GCCAAAAGGC CGAAACCGCC
CTGGCACTTG ATGCCAAGAA TGCACGGGCT CTCGAAGTCC TCAACAAGTC GCAGCCTGCT
AAGTGA
 
Protein sequence
MRVVLDVGLI SAKFFFCLLL ALFCVGGVAS RNLCAADEPS PTVTWREAYE HLQKGRYEEV 
EEAYEFLKKQ SATKDDLPGE FANSPHPLTG PQYAEIALAR IDLETGHRAR AYERLEALKK
LHSEQPRILG LLAWYAFLNG KLDVAETSAQ EAIRIEADEL WARRTLAEVY QATGRLKQAD
EGWRYFVRYY NRVQPEKAEE LLLAAEGSLA YARWHTGKQI FDFVLNTLCI DALKADPLFW
QAHELSGRLL LEKYNKPQAQ QEFQAALAIN PRAVTVLLGR AQAAAQDYDW DESNRLAKEV
LKNAPGEPLA HTLLARSLLF SNQPEAALEQ LQLAKAICPT DPTTTGLIVA AQIQRDGIRS
LPRLQLLLES IDHIADLPAA ADAPEAYETT LIAAAKINSA CGEVLATTGE ALEMHRKFEL
AEKFYRAALA VMPQLTAARN NLGMLTLQMA RVDEARTMLD QAFKSDPYHM RVSNMRKVIR
QLDGYATLST DHFVIRYDNA QDELLARYMS KFLENEVYPQ LVKQFGYEPS TRTTIEIYSK
GNGQTAHEWF SARMVGLPWV QTIGASTGMM IALASPNGLN EPYNWSRVIR HEYVHVLTLQ
QTQFNIPHWY TEALAVRNEG YPRPAEWNDM LRKRVPRRDL RNLSNLNLGF ISAKNGDDWN
FAYCQSDLYA NYLVERFGEP ALEKLLLAYR AGKTTEVALK ELFQTDIKDF EAGYLDYLDR
IVAQLPGQAE ASIEYSKSEV EAALEKEPRN AEWLGRAAML KAKDRRRDEA RKLAREALEI
DPHCATAAIA LAELDLRGEA PEKAIQKLAS ALDNTQPDHQ LLQRLLPLLI QTKRWDEALK
YASLAETTFP SDISSTVALA EILPHFMDLP RYKAVLEKLS MYENDESEYR LQRAEIAWKE
SDLPNVARYA AMVLEIDVTE PKAHWLLAEG LSGVKPEEAL EEFSIACKLD DELAEAWAGW
AVLLQTRGQV QEARQKAETA LALDAKNARA LEVLNKSQPA K