Gene Plim_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1040 
Symbol 
ID9137726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1312724 
End bp1316092 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content54% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003629081 
Protein GI296121303 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTACG CCCGCGCGTT CGTCATCGTT TCCGACGATC GATACTTCCC CGGACTTCAA 
GCCGCACTGG GAAGTATCCA CGCCTACTAC GGACAAGAGA TCCGTGTCTT CGTCGTCGGT
CATGGACTGA CGGCACCGCA GGTGGAGTCA CTGAAGAACC ACCCGCTGGG ATCCGCCATT
ACCTTGCTCC GCACCCAGGA CTTCGCCTCC CGGCCCAGCG GATGTTGGGA AGCCAAGCAG
CTCTGCCTCA GCGAGCTGGT TGCGTCCGTG CGCACCGTCT GTCTGATGGA CGCCGATCTG
ATCCTTCTCT CGCGGGTGGA CGACATCTTC GAACTGGCGG AGCAGGGAAA AATCATCTCC
TCGCGCGACG GAGAGGGGAT GCAGTTCGGG CCCGAATACC AGGTTTACTC ACCCGCTCTG
GTGGGGCCGC GGCAGGACTA CATCAACTCG GGATTTCTCA TATTCGATCT TCGCCAGCAC
TGGGATCTCG TCGCCCTGTG GTCGTTCACG GCTCGCTTCG CCGGTTACTC CCCCGGCAAG
GGTTGGCCCT TCGCATTTCC GGGACATGGG GATCAGGGGG TCTTCAACGC ACTGCTGGCC
CTCCAACAAA AGTCTGATTT TCTGCATGCT CTTCCGGAAA ACACTTGGTG CAACTCTGCT
GGATGGAAAG AGGGGCGAAA GGTCCGCGTT ACACATCAGG AGGGGAACCG CTTGAGTGTG
ATTCATGAAC CTGGCGGAGA ACAGCAGCGA GTTCTGCACA GTTCTGGCCC TAAGTGGTGG
ACAGACGAAG GTCGCACTTT CTTCGCATCT GCTGGTGATG TCCTGCGATG TTTTGATGCT
ATGCGAAAGA TCACCGAGCC AACGATGGGC TCAAACAAAA ACCTTGGAAT CAGTTCTCCC
GAGGAGAGGT CAAAACCTGC GTTGAGTGTT TGTGTGGCGA TCAAAAATCG CTCGAAAGTA
AGTGTCGGGC TAGAAAGTCG GGAATTATTT CCCAACTGTA TCCGCAGTCT GCTCAGTGCG
TCGGCTGGCG GCTTGAAAAT TGAACTCGTC GTGGTGGATT TTCAGTCGGA CGATTGGCCA
CTCGAAGAGT GGTTCTCAAA TTTGGTGGGT GACGTTGCTC ATCAGTTAAT CACTTTGCGG
GAACCTTTTT CACGGGGGCG GGGACTCAAT GTTGCAGCAT CTGCTGCACG ATCTGATCGA
TTGCTGTTTC TCGATGCCGA CATGCTTGTT GACAATTCTG TGCTTGAAAG AGCCTACGAG
GTCGTCAATT CTGGTCGAGT CTGGCTTCCC ATCTGCGAAT GTGTGAAGTC CGATGGAACC
TTCGACAGTT GGGGTTTTTT GGGGAACGTG GCCTGTGATC GGGAAGTTTG GAATGCAGCA
GGTCAGATTC CCGAGTTTTA CAGTTGGGGC GGTGAAGACA ATCTGTTTGC AGATCGCTTG
CAATCCATCA AGCCGGGCAT TCGCGAAGTC AGTCTCGGTT TCCGACATCA ATGGCATCCC
GAGTCGTTGC GACATGCTTA CTACATTCGC CCTGCTCAGA GCGACTTCTT TCGAATAGAT
CAACAGGCTA CAAGCCCAGT CCAGCAATTG TGGCGACTCT CAGCACATCA CCCGGAATGG
CAGGGAGAGA TACAGTTCTT CACAACTGGA CGAATGGGGC GTCCGGGAGG AATGTTTGGT
AGTTTCCAGG CCTCTGCCGA ACAACTGACC TTGCACTGGG ATTCGGGACA TTCTGATACA
TTTCGGTGGG ATTCCACTGT CCAAAGATAT GTGCATGATC TGCTGGATTT TTGGTTGGAA
CCTTTCACTT CTCTGGAATC CATGGATATA GAACGTCTTT TGAACCGCCA AGTTGAAGCA
TTATCCCTAG AAGAAACTCC TAAGTCGTTT AAGCACCTCA CAATACCACT TCCGGATCTA
CTCTCGCGCC GCGATCTTTT GCCTCGAATT CTGGTCTTTA GTTCTGTTGG TGATCGGGGA
ATTGCTGTTA ACTCGTGGCT GGGGATCTCT TCTGCACGAC AGGATCGACC GTTTGATACA
GCCATCGTCT ATTATGGCAA CTATCCGACC GGTGACTGGG CACAGGGACT CCGCGCACGC
TCCTGCTATT TTGGGCGTCA TCGTGGGGGT AAAATTCAAA ATCTCTCTTG GCTAATAAAG
CAGCATCCCA CTTTGCTTGA AGATTATGAC TACGTATTCG TTCTTGACGA CGACTTACGC
ATTACGCCGG ATATGATTGG GCGCGTGGTG CGAACGGCCC GGGAATTCGA TCTCCCCATC
TGTTCACCTA GCCACGACTG GAGAGGAAGG ATTTCCTGGC TCCACATGGG AGCCCGCGGA
AGTGGTCGCC GGGCTGCTGA ACCTCCTGCT GGAGTGGAGT TAACCAATTT CGTCGAAGTG
ACTTGCCCGC TCTTTGAGAC AGATGCTCTT CGCGAATATC TGAATCACTT CCTCTGGTGC
GGTTCTGAAT TAACCGGTTG GGGTGATGAT TGGTTGATGA CGGCGGCCTG TTTCCAAGAG
AGTCGTCCCT TTGGCGTGCT TCACAACATC ACAATTTGTA ATCCCAAAGA TCGGCCAGGA
CCCAACCCCC AGCTTCGTGA AATTGATCAA CTGCATGCGG TGGAAGAGAG GCTGACAGCC
TGGAAGCGAG TCAAGCAACG ATTGGGTGGC CGCCTTCCCG TCACAGAAGA AGTTCGCACG
TGGCTACCTC GCCCACGCCT CCGCTGTGTC AATCTTCCAC GAGCCACTGA GCGGAGAAAG
AAGATCACTT GTGAATGGAT CGATGGATTA GGATTCCCCA TTAAGTTCTT TCCGGCATTT
GATCGGCGGG AATTGGAAAA AGGCCGAAGT TTTTTCCAAT ACGAGGATTC AGCTGCCATC
AACAAGATCG GACGCCCACT TACAGCAGGA GAAATTGCTT GTGCAAGTTC ACATGCTCTG
GTAATACGTG AAGAAATGGA GTTCACGGGT CCTGAAGGAG TCATAATCCT CGAAGACGAT
GTGACGCCTC GATGGGGTGC TGTCGATCTT TTCGAGCGAT TGAAAACTGC TGCCGCTGCC
TTACCTGGTG TCGAAGCCAT CGCCTGTCAT GAAGCTTTGG ATTCTTGGGA ACGAGGTGAA
AGCTGCGGAG AAGCCGTTCG AGCCCTGAGT CCACCCTGGG GAACGCATAT CACTTGGTAT
AGCCACGCTG GCCTCTGTCG AGCCTATGAG TCGCTCATCA AGTTCGATCA ACCTGCGGAC
TGGATCTGGC GAGAGTTCTG CTCGCGGGGA GCTTACGCTC TCCTCGATCC GCCAATCGCC
CAACATCGGG GCGACTCGAC TTATGTGGGA AATGATTTTC GCGGAATGGT GCGACCCTAT
CGGGAGTAG
 
Protein sequence
MEYARAFVIV SDDRYFPGLQ AALGSIHAYY GQEIRVFVVG HGLTAPQVES LKNHPLGSAI 
TLLRTQDFAS RPSGCWEAKQ LCLSELVASV RTVCLMDADL ILLSRVDDIF ELAEQGKIIS
SRDGEGMQFG PEYQVYSPAL VGPRQDYINS GFLIFDLRQH WDLVALWSFT ARFAGYSPGK
GWPFAFPGHG DQGVFNALLA LQQKSDFLHA LPENTWCNSA GWKEGRKVRV THQEGNRLSV
IHEPGGEQQR VLHSSGPKWW TDEGRTFFAS AGDVLRCFDA MRKITEPTMG SNKNLGISSP
EERSKPALSV CVAIKNRSKV SVGLESRELF PNCIRSLLSA SAGGLKIELV VVDFQSDDWP
LEEWFSNLVG DVAHQLITLR EPFSRGRGLN VAASAARSDR LLFLDADMLV DNSVLERAYE
VVNSGRVWLP ICECVKSDGT FDSWGFLGNV ACDREVWNAA GQIPEFYSWG GEDNLFADRL
QSIKPGIREV SLGFRHQWHP ESLRHAYYIR PAQSDFFRID QQATSPVQQL WRLSAHHPEW
QGEIQFFTTG RMGRPGGMFG SFQASAEQLT LHWDSGHSDT FRWDSTVQRY VHDLLDFWLE
PFTSLESMDI ERLLNRQVEA LSLEETPKSF KHLTIPLPDL LSRRDLLPRI LVFSSVGDRG
IAVNSWLGIS SARQDRPFDT AIVYYGNYPT GDWAQGLRAR SCYFGRHRGG KIQNLSWLIK
QHPTLLEDYD YVFVLDDDLR ITPDMIGRVV RTAREFDLPI CSPSHDWRGR ISWLHMGARG
SGRRAAEPPA GVELTNFVEV TCPLFETDAL REYLNHFLWC GSELTGWGDD WLMTAACFQE
SRPFGVLHNI TICNPKDRPG PNPQLREIDQ LHAVEERLTA WKRVKQRLGG RLPVTEEVRT
WLPRPRLRCV NLPRATERRK KITCEWIDGL GFPIKFFPAF DRRELEKGRS FFQYEDSAAI
NKIGRPLTAG EIACASSHAL VIREEMEFTG PEGVIILEDD VTPRWGAVDL FERLKTAAAA
LPGVEAIACH EALDSWERGE SCGEAVRALS PPWGTHITWY SHAGLCRAYE SLIKFDQPAD
WIWREFCSRG AYALLDPPIA QHRGDSTYVG NDFRGMVRPY RE