Gene Plim_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3167 
Symbol 
ID9139881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4096903 
End bp4100043 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF1080 
Protein accessionYP_003631181 
Protein GI296123403 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.213021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTTCC ATGTGAAATC AGTGGTTGAG ACAGGTGGGC AGAGTCAGGT CTATTCGGAC 
AATCAAGTTC CGGCCTGCCT CCGCTTTGGG TCAAATGTCA TATCAACCTT TCTCGATTTG
TACCGCAAAA GGTGTTTCAT GAGTTTCTGG ATCAGACCTC GCGGTTCGCA CCATCCAGGC
CCTGTCCGCC ACAAGCCCGG CATGGGCATT CACTGGATGA TCCTGTGCTC TTTGACACTC
TCGATCATTT CGGGATGTGG GCAAAGCCCA TCCAGCAGCC AGACTCCCGA GCAGGTGACA
TCGAATGCAA GCTCTGCTTC CACTGAACCC ACTGCGAATG CCACCACAGC CGCACAGCCT
CAGGCAGTTG CCAACTGGCT CGAAAATCCT CAGGCCATCG ACTTGTCCGG GCGGTATGGG
CGTCGATTGC AGATCGTCTG GAAAGACGCC ACTCGCGATC CCATGGAGGG CGATGACGAC
GACGACTCGA TGGCAGATCG CGATGTGACG TCCCCAATTG ATCCAGCAGA CGCGAACGAT
CCCCTGATGG CCGAGAATTT GCCAGCGGGC TCGTCACAGG TCACCACAGG TAAAGACAAC
GCTTTTGGTG CTGCCAACGA AGAGAAACCT GACGATGCCA GCGTGAGCCA ATCGTCTCAG
GCCGATTGGT ATGTGCTGAT TGACGGTGAG CCCGTGCGAG ACGAGACCGG GATCCTGTTG
TCGATCCCTT GTGAAGTCCG TGTCCCTCCT GGTCCTCATG AAGTGACTTT GGCTCAGCCG
GGAATGGTTG ATCTATCGAA AAAAGTGGAG ATCCGGCAGG ATCGCAAAGT CAGTTTCGAA
CGACCCGAGA AGCCGGTGCG TGGAGCCTCT TCGCTTTCGG GGCCACTCTT TGACCTGGCA
AGAGGCGAGT TCCTTCCACT CGATGAACTC AACACTGCCG GGAAAGAGTA CGACCCCTGG
CTTTCTGCTG ATGGACTCAC GCTGGTCTTT GCCGGTGATC GAGCGGAAGG CCGAGGCGTT
TACCTGGCGA CACGTCCCAC AAGATACCAT GCCTTCTCGC CTGCCGAGTT AATTGAAATT
ACACGGTCGG GCGAATTCGT GGCCACGCCC GTCCTCTCAC CGGATGGGTT GAGTTTGATC
TATGTGCACC CCGCCCGCAC GCGGATCTGG CTGCTGGAAC GCAATTCCGT CGATGATCCC
TTTGTCAAAC GGACAGCTCT TAGGAGCATC GATCAACCGG GGTTCGAATG GATCGGGGCT
CAACTGATGT TCCCAGAGGC CATTTCACAG ACACCGCAAT TCAAAGCTTC GGCAAACTTC
CAGTTAGCTT GGATCGAACG TAACGCCGCA GGTCAACAGC AGGTTTTTGT CGCCGAAGGG
CCGAGTCTGG GGAGTTTGTC CAAAGCCCAG GGCAAAGCGA TCATTTTACC CGGAGACCGG
CCCTGGTTTA CCGCTTCGAC GGATCGTCAA TTCAGTCTCG ATGGTCATGT TCTGCAGAGG
TGGATTCGTG AACAATCTCT TGGGCCGGTC GCCTCGGCAC TCTATGGAAA TCCATCGCCC
ATTGGAGAAT TTCCCGCTGG ATTTCCTCCT CCACTCGCCA ACGAACGATC GCTGTTTATT
ACCGATGATG AGCAGTGGGC TGTCGCTGCC GTCACATCCG CAGCCAGCGA AGCCACACTA
CAACAACAGC CGGGCGATCT CATGCTGCTT CGTCTCTCTG ATGGCCCGCA ATGGGGTTGG
AAATTCCAGG GGCGCAGTCT GAAGCCAATC CCCTCCACCG AGACCACACC GCCGACATTA
GTCGCTGATC AGTCAGCCTC GAAAGTCATG AATGCAGAAG ACCGTTCATC ACCAGCCACG
ACGAATCCAT CAACTCCCGT TCGTGAGGAG ATGGATCCCT CCGTGAAGAC GAACGCCGAT
CCCGCAGCGA CCGTCGCGAA CCAGCCTGTG CAACCAGTGG AAGTTCAAAC GGCCTATTCG
ACCTATGAGA AATCTCTGGC TGAGTTTCGC AAGGCTCTGG AAGCTCGAAA TTATGAACAG
GCGGCACAAA TCCTCCAGCA GCGGCGAGAA ACTTCGTTTG CGACCGCATT GAATCCACTG
ACAGAACTCG ACCTGGCCTG GCTGAAAACA CTCAATGAGT TTCAGGTCAT GGTGAACGAT
GGTGTTCGCC AGCTGGAGCC CGGCACCACA GTACGCGTCG GTTCTGCCAA GCTGGAGTTG
ATTGGTCTCA AAGAGGGTGT TCTGAGTTTG AAGTCGCGAC TGAAGACCAT TGAAAAACCT
TTGTGGGAAA TGTCGACGGG CGATCTGCTG GCACTGGCCG AATCTCTGCC GGGTGGGACG
AATCAGGCAT CCGCGCTGAA GACGCTCGCT TTCGTAAAGG CAGATCCTGT TCTTCCAGCA
CGAGTCATCG AATTGTGGCT GGGCCGTGCG GGCCCCGGCG GACAGGATTT TCTCGAAGCG
TTCAGCACTC GCGAACTTGA AGAAGGCCGC CTGGCTCTGG CCGAAAATCG GTTAAGCAGT
GCGATCGAGC ATTTTGGCAA GACGATTGCC GCCGGGCCGG AAAGACCCGC CGCACAGGCA
GCCGAAAAGG AAAAAGCCCA GCTCTATGAT CGCACTCGCT GGAAGATTGT CGGCAAGCGT
GACTGGGCTC GCGGCCCTGA CGGTGAATGG AGTGCTGATG CCCGGCGGAT CGACGGGGCT
TATCTCGTTT CCGAAAGTGA TTACGAAAAC TTCGTATGCG AGTTTGAATG GAAAGCCGAT
CAACCCGGTG CACAGGGTGG GCTTTACTTT CATTATGCCG GTGAAGGAAA CCCCTTTGAA
TTTGGCTATA AAATCCATCT TGCTGGCGAC ATGGATCAGC AGGGAATGGA TCAATATTCA
ACGGGTGCCC TCTTTGGATC GGATGCACCC AAAAAGAAGG TCGCCAAAAA GAATGCTTGG
AACCGCTTCC GTTTGACAGT CGTCGGCCCT AAGACGACTG TCCAGATCAA CGATGAAGTC
GTGCTCGAAA CCGATGTGCC TGTTTCCAAA AGCGAACCTC GTGGTTATCT GGCAATCGAC
GGAGTGGGTG GCGCCTTCCG CTATCGCAAG ATTCTGGTTT ATGAACCGAG CAACTCTCCA
GCTGCCAAGC CTCAAAACTA G
 
Protein sequence
MIFHVKSVVE TGGQSQVYSD NQVPACLRFG SNVISTFLDL YRKRCFMSFW IRPRGSHHPG 
PVRHKPGMGI HWMILCSLTL SIISGCGQSP SSSQTPEQVT SNASSASTEP TANATTAAQP
QAVANWLENP QAIDLSGRYG RRLQIVWKDA TRDPMEGDDD DDSMADRDVT SPIDPADAND
PLMAENLPAG SSQVTTGKDN AFGAANEEKP DDASVSQSSQ ADWYVLIDGE PVRDETGILL
SIPCEVRVPP GPHEVTLAQP GMVDLSKKVE IRQDRKVSFE RPEKPVRGAS SLSGPLFDLA
RGEFLPLDEL NTAGKEYDPW LSADGLTLVF AGDRAEGRGV YLATRPTRYH AFSPAELIEI
TRSGEFVATP VLSPDGLSLI YVHPARTRIW LLERNSVDDP FVKRTALRSI DQPGFEWIGA
QLMFPEAISQ TPQFKASANF QLAWIERNAA GQQQVFVAEG PSLGSLSKAQ GKAIILPGDR
PWFTASTDRQ FSLDGHVLQR WIREQSLGPV ASALYGNPSP IGEFPAGFPP PLANERSLFI
TDDEQWAVAA VTSAASEATL QQQPGDLMLL RLSDGPQWGW KFQGRSLKPI PSTETTPPTL
VADQSASKVM NAEDRSSPAT TNPSTPVREE MDPSVKTNAD PAATVANQPV QPVEVQTAYS
TYEKSLAEFR KALEARNYEQ AAQILQQRRE TSFATALNPL TELDLAWLKT LNEFQVMVND
GVRQLEPGTT VRVGSAKLEL IGLKEGVLSL KSRLKTIEKP LWEMSTGDLL ALAESLPGGT
NQASALKTLA FVKADPVLPA RVIELWLGRA GPGGQDFLEA FSTRELEEGR LALAENRLSS
AIEHFGKTIA AGPERPAAQA AEKEKAQLYD RTRWKIVGKR DWARGPDGEW SADARRIDGA
YLVSESDYEN FVCEFEWKAD QPGAQGGLYF HYAGEGNPFE FGYKIHLAGD MDQQGMDQYS
TGALFGSDAP KKKVAKKNAW NRFRLTVVGP KTTVQINDEV VLETDVPVSK SEPRGYLAID
GVGGAFRYRK ILVYEPSNSP AAKPQN