Gene Plim_3689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3689 
Symbol 
ID9140407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4749311 
End bp4751395 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content56% 
IMG OID 
ProductOligopeptidase A 
Protein accessionYP_003631700 
Protein GI296123922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAC TTCTCGACAA CCCTCTCCTG GTAACCAGCG GGTTACCAGA TTTTGCCCGG 
ATCGAAGCCT CACATGTCGT CCCGGCTGTG CGTGCGACTG TCGAGACCGC TCTCAAAAAA
CTCGATGCCA TCGAATCCCA TCTGCAGCCA ACATGGGCTG GCATTATGTC CCCCATCGAA
GAGATGGAAC GCCCCTTCAC CTGGAGTTGG GGCCCCGTGG GTCATCTCCT GGGTGTTCGC
AACAGTCCCG AACTGCGAGC TGCCTATGAA GAGGTCAATC CTGAAGTCGT ACGCTATAGC
CTGCGCGTCA GGCAAAGCGA ACCGATCTAT AAAGCCCTGG TGACTCTGGC CGAGTCGCCG
GAGTGGGAGA GGCTTTCGCC GGCCCAGAAG CGGATCATTA AAGATCGCAT CAAGGATGCC
GAACTGGCCG GGATTGGTCT GCAAGGCGAC GCGCGAAAGC GATTTGGCGA GATTGAAGAA
AGGCTGGCTG TCCTTTCCAC ACAGTTCATG AACAACTGCC TCGATGAAAT CAAAGCTTTC
TCACTCGATC TTACCACGGA AGAGGAAATC GCCGGGTTCA CTCCCACACT CAGGCATCTG
ACGGCCCAAT CGTGGAATCG TGCTCACCCG GAAAGTGAAA CCAAAGCGAC CGCGGAGCAT
GGTCCCTGGC GCATCACGCT CGATTTCCCG GTGTATGGCC CTTTCATGGA GCACGCGAAA
AGGCGCGACT TACGCGAGAA GCTCTATCGG GCATTCATCA CTCTCGCTTC CCAGGGTGAA
CACAACAATG AACCCATCAT GCGGGAACTG CTGAGCTTGC GCAAAGAGAA GGCGCACCTG
CTGGGTAAGA ACTCGTTTGC GGAAGTCAGC CTGATGCGCA AGATGGCTCC CGGTGTGGAT
GCCATTCGCC ACATGCTGCA TGAACTTCGC GATACAAGCT GGGGAGCAGC ACAACAGGAT
CTCGCCGATC TGAAGGCGTT CAAAGTCTCC AGCGGCGATA CCGATGACAT CAAACCCTGG
GATGTTCCCT TTTGGGCCGA ACGGCTGCGC GAAAGCCGGT ATTCGTTCAC CGACGAACAG
ATTCGCCCCT ACTTCCCATT TGAACGTGTG CTTGAAGGAT TGTTCGGTCT GATTCATCGG
CTCTTTGGTG TCACGATTGA ACAGGCGCAA GAACCCGTCT CGGTCTGGTG CAGCGATGTC
CGCTTTTATC ATGTCCTCGA TGAGTCGGGC CAGAAGATGG CCGCCTTCTT TCTGGACCCT
TACTCGCGAC CCGAAAACAA ACGGGCTGGT GCCTGGATGG ATACCTGCCT TTTGAGGCAG
AAGGTTGGCG ATGAACTTCA GCTTCCCGTC GCGTATCTCG TTTGTAATCA AACCCCACCT
GTGGGTGAGC GGCCCGCCCT CATGACCTTT CGCGAAGTGG AAACGCTATT CCACGAGTTT
GGTCACGGTC TCCAGCACAT GCTGACCATC ATCGATCATC CCGATGCCTC GGGAATCAAC
GGCGTCGAAT GGGATGCTGT CGAACTCCCC AGTCAGTTTA TGGAGAACTG GTGCTATCAC
AAGCCGGTGC TGATGGGGAT GACTCGTCAC TACGAGACCG GGGCACCATT GCCAGAAGAT
CTGTTCAACA AGATCGTCGC AGCTCGCACT TATCGCGCCG GGTCGATGAT GCTCAGGCAG
CTTCTCTTTG GTCTGACGGA TCTCGAGTTG CACCACGATT ACGATCCTGC GGGAAGCGAG
TCCCCTTTTG ATGTACAGCG CCGCATCAGC CAGACGTGCG CGGTCATTCC GCTCATCCCG
GAAGATCGCT CGCTGTGCTC ATTCCAGCAT ATTTTTTCGG GCGGCTACGC AGCGGGATAC
TACAGCTACA AGTGGGCCGA AGTTCTCTCA GCCGATGCCT TCAGTGCTTT TGAAGAGGCG
GGTCTTGATG ATGACAAGGC CATTGAACAG GTGGGCCGCC GCTTCCGCAA TACAGTGCTG
TCGATGGGCG GCAGCCGACA TCCGATGGAA GTCTTCCGCG ATTTCCGTGG TCGCGAACCG
AGCCCTGAAG CACTTCTCAG ACACATGGGT CTGACAAAAG TGTGA
 
Protein sequence
MAELLDNPLL VTSGLPDFAR IEASHVVPAV RATVETALKK LDAIESHLQP TWAGIMSPIE 
EMERPFTWSW GPVGHLLGVR NSPELRAAYE EVNPEVVRYS LRVRQSEPIY KALVTLAESP
EWERLSPAQK RIIKDRIKDA ELAGIGLQGD ARKRFGEIEE RLAVLSTQFM NNCLDEIKAF
SLDLTTEEEI AGFTPTLRHL TAQSWNRAHP ESETKATAEH GPWRITLDFP VYGPFMEHAK
RRDLREKLYR AFITLASQGE HNNEPIMREL LSLRKEKAHL LGKNSFAEVS LMRKMAPGVD
AIRHMLHELR DTSWGAAQQD LADLKAFKVS SGDTDDIKPW DVPFWAERLR ESRYSFTDEQ
IRPYFPFERV LEGLFGLIHR LFGVTIEQAQ EPVSVWCSDV RFYHVLDESG QKMAAFFLDP
YSRPENKRAG AWMDTCLLRQ KVGDELQLPV AYLVCNQTPP VGERPALMTF REVETLFHEF
GHGLQHMLTI IDHPDASGIN GVEWDAVELP SQFMENWCYH KPVLMGMTRH YETGAPLPED
LFNKIVAART YRAGSMMLRQ LLFGLTDLEL HHDYDPAGSE SPFDVQRRIS QTCAVIPLIP
EDRSLCSFQH IFSGGYAAGY YSYKWAEVLS ADAFSAFEEA GLDDDKAIEQ VGRRFRNTVL
SMGGSRHPME VFRDFRGREP SPEALLRHMG LTKV