Gene Plim_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3921 
Symbol 
ID9140639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5035286 
End bp5038579 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content56% 
IMG OID 
Productprotein of unknown function DUF1549 
Protein accessionYP_003631931 
Protein GI296124153 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCAGC CTCTTCCCAC CACCTGCCAA TTCGCCTGGT GTTTCAAGTC AATTGTGATC 
TTCGGCGTGT CGCTCGCATG GCTCTATGGC AGCCCGGCGA CAAGCGCTGG TGAGGCACCC
ACTTTTCCGC AAGAGCAGAT CGAGTTTTTC GAAAAGAAGA TCCGCCCCAT CCTGGTCGAG
AACTGCCACA GTTGCCATGG AGCCAGTCAG CAGAAAGCTG GCTTGAGGCT CGATTCTCGC
GACGCGATCT TGCGAGGCGG TGAATCGGGG GCTTCCGCAG TCGCTCAAAA GCCCACGGAA
AGCCTGCTCA TCGAGGCCAT TCAATACGAA GCCGATGGTT ATCAGATGCC TCCCAAAGGG
AAGCTTCCCG CAGAAGCCAT TGCCGATCTG ACTCGCTGGG TGGAGCTGGG AATGCCGTGG
CCCGCTGGCG ATGTTCCTGC CGAAAAAGGC ACGGCAGCCG AGTTCAATTT TGAAGAGCGA
GCGAAGCACT GGTCGTTCCA ACCAATCACC AGACCGCAAG TCCCTCAGGT TCAAGAGGCT
GGCTGGGCTC ACAATCCCAT CGATCAGTTT GTGCTCTCCA AGCTGGAATC GGCCCAGTTG
TCACATGCTC CTCAGGCCGC ACCGCTGGCT CGTTTGAGGC GTTTGGCCAT CGATCTCACG
GGGCTGCCAC CCACTGCCCA GGAAATCGCC GAGTTCCAGG CCGACGTTCG CCCGGATGCC
TGGCAGCATT GGGTGAATCA TTATCTGAAT TCACCTCACT ACGGCGAACG CTTTGCCCGG
CACTGGATGG ATCTGACACG TTATGCCGAG ACGCACGGCC ACGAGTTCGA CTACGAAATC
CCCTATGCCT GGCCTTATCG TGATTATCTG ATTCGAGCCT TCAATACCGA TGTCCCTTAC
AACCAGTTTG TCGCCGAACA TGTTGCGGGC GACCTTCTGC CGCAACCACG ACTTGATCCC
GCCACAGGTT TGAATGAATC CATCTGCGGA ACGGCCTTCT GGTGGTTGTC TCAAGGCAAG
CATTCGCCGG TCGATATCCG TTCGGAAGAG TGCGATACGG TCGACAATCA ACTCGATGTC
TTCAGCAAGA CATTTCTGGG AATGACGGTG GCCTGTGCAC GCTGCCATGA TCACAAGTTT
GACCCTATCC GGATTCGCGA TTACTACGCA CTCGCGGGGT ATCTGCAAAG CAGCCGGCGG
GATGTCGCGA ATCGAGTTCC GGTCAGCACT TATGAGGCGA TGATCACTGA TGCCGCCCGG
CATCACGATG GACGATGGAA AGCGATCCTG GAGATTCAGC CGGAATTGTT CAATCAGCCG
GCCAGGTGGT TGAGCGCGAT GACGACTCGC CATCAGAAGC GGATAATGGC CCATCCGACG
GACTGGCTCT CGGCCTGGCT GCAGCTGGGA TTTCTTACGA ACAAAGACGA ATTTGCCGCA
AAAAAACAGA GCATGGCCCG TACCCTCAAA ATTCAACAAG AGCAGGCGGA GGCGGCTCGT
AAAGCTGCCA CAATTCTGGC TGATTTTGAG CCGGGAACTT ACAACGACTG GATCACAACC
GGAGCAGCGT TTTCGTTGAA TGAAAACTCA CTCCTGATTG ATGACGGGCG GCCCGGTGAA
GGGGAAGTTT CGCTTCAGGA GGCAGGGACG GCTCACTCCG GTGCAGTTTC TCGAAAGCTG
ATTGGGTCAC TCAGTTCACC CACTTTTGAA CTTAAGCATC GTTATCTGGA TCTTCGAGTC
GCCCGGCTGG GCGGGCCGCC ACAACCCGGC AGGCAGATCA AAAATGGGCA AGTCCATGTC
ATTATGGATG GCTTTCAACT CATTAAAGAC CCGCTCTATG GATCGTTCAC CTTGAATGTT
CCCAACGACG GTCAATGGCG GTGGTTTCGG CTGGATCTCT CCCGGGCGAT GGGGAGCAAG
ATCTATCTCG AAATTGTCGA TGAAGCTGCC GATGGCTGGA TCGCTGTTGA TGAAGTCCGA
GTGACTGATG GCCCGCGAGC TGTTGACGAT GTACCGGCCA TGATGTTGGA TTGGCTCAAT
GATCCAGCGA TTGTCGAACC CGAGCAACTG GCGGCTAAAT ATGCCGCATG GTGGCAAAGT
GGTGGTGGTC AAGAGAAAGA CGAATTTCCT TCGATCCCAT TCGCGCAGGC TTTGCGTGAT
CTCTGGCAAG CGGGCGTCCA GAATGAGCAA CTCTTTCCAG AAGGATCTGC CTGGAAATTG
AAGCGGCAGC AAGTCGAGCA GCAGCAGGCC CGCTGGCAGG CGGCGATGAG CCAACTTCCC
GAGCCCGAGT TTGTTCTCGC TATGGCCGAT GGTACCGCAG AAGATGATCA TGTCCTGCTG
CGGGGAAATC ACAAGAAGCC CGGCCCTGTC GAGCCTCGTC GACCACTGGA AGTTCTGGGT
GGACTGAAGA TCAAGGCACC TGATCACGGG AGTGGGCGGC TCGAACTCGT TGAGAATCTG
GTGAGCCCTG ACAACCCGCT GGTGGCGCGA GTCATCGTGA ATCGACTGTG GCATTACCAC
TTCGGGCGAG GATTGGTTCC GACGCCGGAT GACTTCGGCA AGATGGGGCA ACCCCCTTCG
CATCCTGAAC TGCTCGATTG GCTCGCCAGT GAACTGATTC AGTCGGGCTG GTCACTGAAG
CATATTCACC GCCTGATTCT GAATTCGGCG ACATGGCAGC AATCGAGCGA TCTGGCCATC
GGTGAAGTCG AGGCGAAAGA TCCACAGAAC ATTCTGCTGC ATCGGATGAA TCCCCGGCGA
CTCGAAGCGG AAGCTGTGCG GGATTCGATT CTGGCTTTTT CAGGCCGCCT CAACAGGACG
ATGTATGGCC CACCCATCCC GCTTCATTTG ACCCCATTTA TGGAAGGTCG GGGGCGTCCA
GGGCAATCTG GCCCACTCGA TGGCGATGGT CGGCGAAGTT TGTATTTGAG TGTGCGGCGA
AACTTTTTGA ACCCGGTCTT TCTGGCTTTC GATTTCCCGA CGCCATTCAC CACCATGGGA
AGACGCTCCA CCAGTAATGT TCCTGCTCAG GCACTCGTGC TGCTCAATAA TCCTTTGATT
CTTGCTGAAG CCGATCGAGC GGCCCAGCAG ACGAAAACAC TTGAGCCCGC CTCAAGGGTC
GAAGCCTTGT GGCTCGCCGC CTATGGTCGC CCGCCAACCT CAGCCGAATC CCGCGAGGCA
ATCGAATTCG TCGATGAACA GACGAAAGAG TACGGCGGCA ACGACCCATC CCCCGCCTGG
CGCGACCTCG CCCATGTGCT GCTCAACAGT AAAGAGTTTT CGTTTGTGCC GTAG
 
Protein sequence
MPQPLPTTCQ FAWCFKSIVI FGVSLAWLYG SPATSAGEAP TFPQEQIEFF EKKIRPILVE 
NCHSCHGASQ QKAGLRLDSR DAILRGGESG ASAVAQKPTE SLLIEAIQYE ADGYQMPPKG
KLPAEAIADL TRWVELGMPW PAGDVPAEKG TAAEFNFEER AKHWSFQPIT RPQVPQVQEA
GWAHNPIDQF VLSKLESAQL SHAPQAAPLA RLRRLAIDLT GLPPTAQEIA EFQADVRPDA
WQHWVNHYLN SPHYGERFAR HWMDLTRYAE THGHEFDYEI PYAWPYRDYL IRAFNTDVPY
NQFVAEHVAG DLLPQPRLDP ATGLNESICG TAFWWLSQGK HSPVDIRSEE CDTVDNQLDV
FSKTFLGMTV ACARCHDHKF DPIRIRDYYA LAGYLQSSRR DVANRVPVST YEAMITDAAR
HHDGRWKAIL EIQPELFNQP ARWLSAMTTR HQKRIMAHPT DWLSAWLQLG FLTNKDEFAA
KKQSMARTLK IQQEQAEAAR KAATILADFE PGTYNDWITT GAAFSLNENS LLIDDGRPGE
GEVSLQEAGT AHSGAVSRKL IGSLSSPTFE LKHRYLDLRV ARLGGPPQPG RQIKNGQVHV
IMDGFQLIKD PLYGSFTLNV PNDGQWRWFR LDLSRAMGSK IYLEIVDEAA DGWIAVDEVR
VTDGPRAVDD VPAMMLDWLN DPAIVEPEQL AAKYAAWWQS GGGQEKDEFP SIPFAQALRD
LWQAGVQNEQ LFPEGSAWKL KRQQVEQQQA RWQAAMSQLP EPEFVLAMAD GTAEDDHVLL
RGNHKKPGPV EPRRPLEVLG GLKIKAPDHG SGRLELVENL VSPDNPLVAR VIVNRLWHYH
FGRGLVPTPD DFGKMGQPPS HPELLDWLAS ELIQSGWSLK HIHRLILNSA TWQQSSDLAI
GEVEAKDPQN ILLHRMNPRR LEAEAVRDSI LAFSGRLNRT MYGPPIPLHL TPFMEGRGRP
GQSGPLDGDG RRSLYLSVRR NFLNPVFLAF DFPTPFTTMG RRSTSNVPAQ ALVLLNNPLI
LAEADRAAQQ TKTLEPASRV EALWLAAYGR PPTSAESREA IEFVDEQTKE YGGNDPSPAW
RDLAHVLLNS KEFSFVP