Gene Plim_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0517 
Symbol 
ID9137194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp642920 
End bp646111 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content52% 
IMG OID 
Productprotein of unknown function DUF1549 
Protein accessionYP_003628564 
Protein GI296120786 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCA TCTCTGTCTG GCTCCTGGTT GTCGCGATGA GTCCGAGTCT TTTGGCTGAG 
GAACTCTCGT TCAATCGTGA CATTCGTCCG ATCCTTTCTG AGAATTGTTT CCAGTGCCAT
GGGCAGGATC CGCAGCATCG AGAAGCTGAT TTACGACTGG ATGTGCGTGA AGAAGCGATT
AAAGACCTGG GAGGCTACCG AGCGATTGTG CCGGGAAAGC CGGAATCCAG TGAATTGGTG
GCTCGGCTGA TCTCCCATGA TCCCGACAAG AAAATGCCTC CGCCAGCTTC GAATCGGCTG
GTGACCAAAG AGCAGATAGA AACCATCAGG CGTTGGATTG AAGAGGGGGC CAGTTACCAG
ATCCACTGGG CTTACCTGCC ACCGCAGAAA TTGAAGCTGC CTGAGGTGCA AAATCCATCC
TGGCCTCGCC AGCCGTTTGA TCGATTGATT TTGTCTGGCC TGGAAAAACA GGGGATCGTT
CCATCTGCCG AATCGTCACC CGAAGTTTGG CTGAGGCGGG TGAGTTTTGA CTTAATTGGC
TTGCCCCCCA CTCCAGACGA GATCCGCACA TTTGTAAATG ATGTGGCTCA GCGACAGGAA
GCGGCTTACG AGCATGCCGT TGATCGTCTG CTGCAATCGC CTCATTTCGG GGAACGCATG
GCGATGGAGT GGATGGATAT CGCACGCTAC GCCGATTCCC ACGGTTTCAA TAACGATGGC
CTGCGGAGCA TGTGGCGATG GCGTGACTGG GTTATCGACG CCTTCAATGA CAATATGCCT
TATGACCGGT TTGTACTCGA ACAACTGGCC GGCGATCTGC TCCCTGCACC CACACTTGAG
CAACAGATTG CCACGGGTTT CTCAAGAAAC CATGTGATCA ACAGTGAAGG GGGCATCATC
GACGAAGAGT ATCGTGTCGA ATATGTCGCT GATCGAGTGC GTACTATGAG TACTGCCTGG
CTCGGTTTGA CGACCGAGTG TGCCCGTTGT CATGATCATA AGTTCGATCC GATCTCTCAA
CGGGATTACT ATCGTTTGTT CGCCTTCTTT AACAATGTCG CAGAGCATGG CGAGGACGGG
CGAACGGCTA ATGCTGTGCC GATGATTCCT GCTCCCACTC GCGAGCAGCA AAGAGTACTG
GCAGATCAGA GAGATCGATT GCGATTGCTG GATGCTGAGA TTGCCGATTT GCAGAAATCG
CCGGAGCTCT TGATAGACTC AAGTGATCTC TTCGAGTCCA AAGCGACCTC TAAAGAAACC
AATGATGATT GGCAATGGTT GCTGGAATCT GGCGAGGCGA GAGATCGTCA GGTGGGAAAT
GGAATTTCTT TCAACGCCAG TCAGCCGCTG ATTCAAATTG CGGCAAAAAA CCTCCCCTTC
AGCAAACATA AGCAGACCAT TCTCTCGCTG TGGATTAAGC CGAATTCAGA TAATTCAGAC
GAGGTGGCCA TCCTTTCCTC GATCGATTAC GGCGGCTCCC CCGCTGATAC TCAATATGGA
AAAGGGCGTG AACTGCGATT GGTGGATGGA GAGCTGGAAT GGCGTGAGAG CAGTCGCCTC
CCAGTCTATT CACGAATCGT CATTACAGAA GGGGCGTCAG TCAGCCCGGA ACAGTGGAGC
CAGATTGTTG TCATCGTGGG GGGAGACAAC AACGCTGCTG CGGTTCGGTT TTTTGTCAAT
GGGCAGGAAG TCGCTACGCA CAGTTTATAT GACGGTCTGA TCAATGAAGC TCCTGATAAA
GATGTGTTGA TCGGTAGAGA CAATGCCAAA GACAGTGTTC GATTTCTAGG TCAGATCGAT
GAGTTGCGCT GGAGCCAACG CTTACCGACG AGCGAAGAGA TTCGCGACGA GTTTCTCAAT
GAAGCGATCC CCTATGCACG ATCTCATCCG AAGTGCGAAA CCTCTCAATC GTGGTTAATG
ACGGCAAGAT CATTAATTCA CGCGAAGTTA AGAACGCTGA TTGATCAGCG GTGCAGGCTA
TGGGAGGAGC ATCTCGCATT ACGCCGGGAG TTGCCAACCA CTATGGTTAT GAAAGAACTC
GGCGAAGCCC AGGATATGCA TGCTCCCCTG GATCAATATA GCGATCGATA CAGCAATCGA
TACCGCAAAA GATATCTTTC TCAGGGACCG CGCAAGACCT ATGTGCTCAC CCGGGGCAAT
TATGACGCAC CTGGGGAAGA AGTTGACCCG GGGGCACTGG AGACTTTACT GGCTCCCTGG
CCAATCGATG CGCCGCGAAA TCGTCTTGGT CTGGCACGCT GGTTGACCCA GCCGCAACAC
CCTTTAACTT CACGAGTCGT TGTGAATCGA TTTTGGGCTC AACTGTTTGG AGTTGGCCTT
GTGAAGACGG TCGAGGATTT CGGGTCACAG AGTGAGTGGC CCAGTCATCC TGAACTCCTC
GATTGGCTGG CCCGTGATTT TGTCGATTCA GGATGGAACG TCAAATCGCT CATGAAATCG
CTGGTGCTTT CAGCCTCTTA TCGCCAGAGC AGTGAAACGA CACCGACAGC GGTGGCGAGG
GATCCTGAGA ATCGGCGAAT TGGACGCGGG CCACGTGTTC GATTACCTGC TGAGTTAATC
CGAGATCAAG CATTAGCCAT ATCCGGGTTA CTCAAACCTC AAATCGGCGG CCCCAGTGTG
TATCCTTATC AGCCAGATAA GCTCTATGAC GGTATTGTGG TTGGTACTGA ATATCCGGGC
TCCAGGTGGC AATTGAGTCA AGGAGATAAT CTCTATCGAC GAAGCTTATA TACGTTCTGG
AAACGAACTG TCACCCATCC CGCCATGCTG ACGTTTGATG CACCAGATCG TGAAGTCTGT
ACAGCCCGTC GCTCGCGAAC AAATACGCCT TTGCAAGCTC TCTTGCTGTG GAATGAAACC
GGTTATCTGG AGGCCTCCCG GAAGCTTGGT GCCCGCATGA TCAAAGAAGG TGGTGACGAA
GATTCCTCTC GAGTCTCCTT CGCCTTTCAA TTGGCAACAG GTCGTTTGCC CGTTGCGGCT
GAGTCAGAGA TTCTGGTCAA GACGCTGAAG CGGTTGCGTA ATGATTTTGA ATGTCGTCCG
GCAGATGCGG CAGCGTTTAT TCAAATGGGT GCCTCACCGG TCGATCAATC CATCGCACCG
ACGGATTTAG CGGCAGCTAT GGCCGTCGCC AATATGATCT TAAATCTCGA TGAAACCATT
ACCAAGAATT GA
 
Protein sequence
MKTISVWLLV VAMSPSLLAE ELSFNRDIRP ILSENCFQCH GQDPQHREAD LRLDVREEAI 
KDLGGYRAIV PGKPESSELV ARLISHDPDK KMPPPASNRL VTKEQIETIR RWIEEGASYQ
IHWAYLPPQK LKLPEVQNPS WPRQPFDRLI LSGLEKQGIV PSAESSPEVW LRRVSFDLIG
LPPTPDEIRT FVNDVAQRQE AAYEHAVDRL LQSPHFGERM AMEWMDIARY ADSHGFNNDG
LRSMWRWRDW VIDAFNDNMP YDRFVLEQLA GDLLPAPTLE QQIATGFSRN HVINSEGGII
DEEYRVEYVA DRVRTMSTAW LGLTTECARC HDHKFDPISQ RDYYRLFAFF NNVAEHGEDG
RTANAVPMIP APTREQQRVL ADQRDRLRLL DAEIADLQKS PELLIDSSDL FESKATSKET
NDDWQWLLES GEARDRQVGN GISFNASQPL IQIAAKNLPF SKHKQTILSL WIKPNSDNSD
EVAILSSIDY GGSPADTQYG KGRELRLVDG ELEWRESSRL PVYSRIVITE GASVSPEQWS
QIVVIVGGDN NAAAVRFFVN GQEVATHSLY DGLINEAPDK DVLIGRDNAK DSVRFLGQID
ELRWSQRLPT SEEIRDEFLN EAIPYARSHP KCETSQSWLM TARSLIHAKL RTLIDQRCRL
WEEHLALRRE LPTTMVMKEL GEAQDMHAPL DQYSDRYSNR YRKRYLSQGP RKTYVLTRGN
YDAPGEEVDP GALETLLAPW PIDAPRNRLG LARWLTQPQH PLTSRVVVNR FWAQLFGVGL
VKTVEDFGSQ SEWPSHPELL DWLARDFVDS GWNVKSLMKS LVLSASYRQS SETTPTAVAR
DPENRRIGRG PRVRLPAELI RDQALAISGL LKPQIGGPSV YPYQPDKLYD GIVVGTEYPG
SRWQLSQGDN LYRRSLYTFW KRTVTHPAML TFDAPDREVC TARRSRTNTP LQALLLWNET
GYLEASRKLG ARMIKEGGDE DSSRVSFAFQ LATGRLPVAA ESEILVKTLK RLRNDFECRP
ADAAAFIQMG ASPVDQSIAP TDLAAAMAVA NMILNLDETI TKN