Gene Plim_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4003 
Symbol 
ID9140723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5138069 
End bp5141140 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF1355 
Protein accessionYP_003632013 
Protein GI296124235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0902905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCAAC AAGCACAGTA CCTGCTGATG CGACTTTTTC CACCGGCTCG CAAGCCAGTG 
ACGCCCTGGA CTGTTGCGCC ATTACTCATC TCGATGGTCT TGACCGTGGG CATTGCCTTC
TGGCTGGAGG CCAGTCGGGT GTTACTGTTG ACCCGGCCAT GGTTACTCTC GCTGGTGGTC
TTGAGTGTGT GGGTCTGGTG GCTGCACATG GCGGGGATGC CGGGCTTACC GTGGCTGCGA
TCCTGGATGG CACTGTGGGT TCGTCTGGCG ATGGTCGGAA TCCTCGCATT TCTGCTGGCT
GAACCCCGCG CTGTTCGTGA GAACGACCGA CAATCACTGA TGTATGTGCT GGATACGTCC
GATTCGATCG GCCGCTCAGC CAAGGATCAG GTGCTGCGCT ATATTGCAGA AACTGTCACG
AAAAAACCGG CTCGCGATGA AGCCGGGCTG TCAGTCTTCG GGCGTAACGC GGCTGTGGAA
TTACCACCTC GTACCACGTT TCTGGCCGAG GCACTCAATA CCGATATTCG TGGTGATGCC
ACCAATATCG AGCAGGCACT TTCCCTTTCC AGCGCCATGC TGCCGGATGA TCAGGCGGGG
AAGATTGTGC TGTTTTCCGA TGGATCTCAG ACTGAAGGGA GTCTCGACCG TATTCTCGAT
GAACTGAAAT CCCGTAAGAT CTCTGTCGAT GTGGTGCCTA TTGAATATGA CTACGAACAT
GAGGTCTGGG TCGAGCGAAT TGATCTTCCG AGCAACGTCA AGATTGGCGA GACCTATGAA
GCGGCGGTGA TCGTCTCGGC CTTGTCGGCT GGTCAGGGCA AGCTGGTTGT TCGCGAGAAC
GGCCAGCCGA TTGTGGAAGA AACGATTTCG TATCGCGAAG GGAAGACACG CCTGGCAGTC
CCGCTCGCCT TACGCCGGCC CGGATACTAT GAATATACCG CCACCATTGA ACCCGAGGCA
GAAGCTGACA GCCTGGCGCA AAACAATATG GCCATGGGTG GGATTGTCGT CGAAGGGGAG
GGGAAAATCC TCGTTGTTTA CGATCCCACG GGGAATCCAC TCGATTGGGA ACCACTGGTC
GAGTCTCTCA ATAAGGCCAA AAAACAAGTT GATGTGATGG CCGGAGTCGA CTTTCCGCGA
GACCCTTCCT CACTGATTCC TTACGACTCG ATTTTGTTTG TGAATGTCCC TGCCAATGAG
TTCGATGGCG TTCAATTGCA GGCACTCAAA GACAGTGTTT TCGATCTGGG AACTGGCTTT
CTGATGGTCG GTGGGCCGGG GAGCTTTGGC CCCGGGGGAT ATCACCGGAC GGCTGTCGAA
GAGATTCTTC CCGTCACGAT GGATATCACA CAGAAGAAGG TGCTTCCTAA GGGAGCACTG
GCCATCATTC TGCATACCTG TGAATTCCCG GAGGGCAATA CCTGGGGCAA GCGAATCACC
AAGCAGGCCA TTAAGGTTCT GGGCGAACAG GATGAAGTGG GCGTTCTGGC CTATGACTAC
AACGATGGTG AGAAATGGAT TTTTGAACTC ACACCCGCAG GAAAGTACGA AGAGCTGTCG
TTACTGATTA ACTCAGCTGA GATTGGGGAT ATGCCCAGTT TTCAGCAGAC GATGCAGATG
GGTATCGATG GACTCGAAGC GAGCGATGCT TCGTCGAAAC ATATGATCAT CATTTCCGAT
GGAGATCCTT CACCAGCCTC GCCCGATCTC TTGAAGCGAT TTATTGACGC GAAGGTGACC
ATCAGCACGG TCGCTGTCTT TCCACACGGA GATGTGGATA CGCCGACGAT GACATCGATC
GCACAGATTA CTGGCGGGCG TTATTACAAG CCGACCAATC CGAATCAGCT ACCAGCGATC
TTCATCAAAG AATCGAAGAC ACTCCGCCGG TCGATGCTTC AAAACCGCGA TTTCTTCCCG
GAAGTTGCTT CGAGTTCGCC AGTTTTGAAA GGCATCAGTT CATTACCGGA GTTGAAAGGG
TATGTACTCA CGACCGCCAA GCCCGATGCT CAGGTTGTGC TCAAAGTTCC GCCCGGTTCG
AAAGAGGAAG AGTCGCAGCT GGATCCACTT CTGGCGATTC GCCAGCACGG GTTAGGGAAG
ACGGCGGCTT TCACTTCGGA ACTTGGCAAG AACTGGGGAA AGGACTGGGT GGCATGGGGC
AAGTATGAGG ATTTCCTCAA TCAGCTCACC ACGGATATCG CCCGCATCCG CAAAGACACA
CAACTCCGCT TGAGCACGTA TGTCGAAGGA GCGCAGGGAG TCGTTATTGT CGAAGATTTT
GCCCCGGAAG AGGGCTTTCT GGAAATCTCC GGACGCGTCG GTGGCCCGAA CGATCGTTCG
GAAAGCCTCA CTTTCCGGCA GGTGGGGCCG CGTCGCTATC AGGCGCTGGT TCCGCTGTGG
GGGCAAGGCC GCTACTACGT TTCGGTGGCA GGTGCGGGAA CAAAAATCGG CGTGGACGGT
CAACCGGCTG AGCGGAAGGA ATCGACGTTT GGCGGATTCG TGCTGGCCTA CTCGCCGGAA
TATCTGCGGT TTGGATCGAA CCGGCAGTTA CTCGAAGAGA TTGCCCAAAG GACAGGTGGG
CGCGTCTTGA CGGGTGATCC AGAAAGTGAC GAACTCTTCC CGAAAGAGCG CGAACCCCGC
CAGAGTTCAC GTCCGATTTT TGACTGGTTT CTTGTGGCGC TGGCCTGTCT TGTCCCTCTC
GATGTCGGTT TGAGGCGCAT TCAGTGGGAT TGGTCTGTCG TGGCAGGCTG GTTCAGACCC
CGCCGGGAAG TCACCTCGAC AATGTCAACT TTGCTCGATC AGAAAAAGTC CGGTTCGCAG
CAGACAACCA CTGAGACCGG CAAACCTGCC GCTGAGGCAT CGTCATCGCG GAAAACACCA
CCACAACGGC CACCCGTCAT TCGCAAGCCA CCGATGACTC TACCGCCATC ACCATCTGCA
AAGACTCCCC CCACTTCAGA AAAAACGCAA ACCGAAAAGC CGGCACCAGG TGCTGCCAAA
TCGACTTATG AAAAACTGCT GGAGATCAAG CGACAACAGC AGAAGAAAGA CGAACCACCC
GCGAAAGATT AA
 
Protein sequence
MWQQAQYLLM RLFPPARKPV TPWTVAPLLI SMVLTVGIAF WLEASRVLLL TRPWLLSLVV 
LSVWVWWLHM AGMPGLPWLR SWMALWVRLA MVGILAFLLA EPRAVRENDR QSLMYVLDTS
DSIGRSAKDQ VLRYIAETVT KKPARDEAGL SVFGRNAAVE LPPRTTFLAE ALNTDIRGDA
TNIEQALSLS SAMLPDDQAG KIVLFSDGSQ TEGSLDRILD ELKSRKISVD VVPIEYDYEH
EVWVERIDLP SNVKIGETYE AAVIVSALSA GQGKLVVREN GQPIVEETIS YREGKTRLAV
PLALRRPGYY EYTATIEPEA EADSLAQNNM AMGGIVVEGE GKILVVYDPT GNPLDWEPLV
ESLNKAKKQV DVMAGVDFPR DPSSLIPYDS ILFVNVPANE FDGVQLQALK DSVFDLGTGF
LMVGGPGSFG PGGYHRTAVE EILPVTMDIT QKKVLPKGAL AIILHTCEFP EGNTWGKRIT
KQAIKVLGEQ DEVGVLAYDY NDGEKWIFEL TPAGKYEELS LLINSAEIGD MPSFQQTMQM
GIDGLEASDA SSKHMIIISD GDPSPASPDL LKRFIDAKVT ISTVAVFPHG DVDTPTMTSI
AQITGGRYYK PTNPNQLPAI FIKESKTLRR SMLQNRDFFP EVASSSPVLK GISSLPELKG
YVLTTAKPDA QVVLKVPPGS KEEESQLDPL LAIRQHGLGK TAAFTSELGK NWGKDWVAWG
KYEDFLNQLT TDIARIRKDT QLRLSTYVEG AQGVVIVEDF APEEGFLEIS GRVGGPNDRS
ESLTFRQVGP RRYQALVPLW GQGRYYVSVA GAGTKIGVDG QPAERKESTF GGFVLAYSPE
YLRFGSNRQL LEEIAQRTGG RVLTGDPESD ELFPKEREPR QSSRPIFDWF LVALACLVPL
DVGLRRIQWD WSVVAGWFRP RREVTSTMST LLDQKKSGSQ QTTTETGKPA AEASSSRKTP
PQRPPVIRKP PMTLPPSPSA KTPPTSEKTQ TEKPAPGAAK STYEKLLEIK RQQQKKDEPP
AKD