Gene Plim_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4094 
Symbol 
ID9140814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5249478 
End bp5252474 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content54% 
IMG OID 
Productprotein of unknown function DUF1549 
Protein accessionYP_003632104 
Protein GI296124326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAGC TTGCTAATTT TCATGATCCG GCTGTCCTGG AACATCGCCG CTGGCCTGCC 
TTAGGTCATC GCTGCTGTTA TTGGGTCTGC CTGGCGATCA CCGGTTGGTT GGGACTGTGT
CAACCGGGCT TATCTCAGCT TCCCCAGGTC TGGGCGGCAG AGGCAAAGCC TGCAGATAAC
AGCAAGCCTG CTGCACAGAT TCTGCCTGCT GAGGAACTGA AGTCATCAGC AGTCGTCGAA
TCGGCAGTCA TCCCCGGGAT CAATGAGCGA TATGACCTGA AGGTCAAGAA TGTGGATGGG
CCCGATTTCC GCAAGCATGT GGTGCCACTG CTGGGCAAAC TCGGCTGTAA CGGACGTGCC
TGCCATGGAT CGTTCCAGGG TCAGGGAGGC TTCCGCCTGT CGCTGTTTGG TTACGACTTC
AAGGCCGACC ATGAGAATCT CGTGCTGGGT GATAAGCCTC GCGTGAATCT GATGGATCGC
TCGAAGAGCC TGATCCTCAT GAAGCCCACC GAACAGGTTC CTCATGAAGG GGGCGAGGCT
CTCAAGCAGG GGACATGGGA ATATCGAGTT CTCGATAAAT GGATTTCTCA AGGAGCCAAG
CCTGTCACCG ATAAGACACC GGAATTTGTA AGACTGGATG TGTTGCCGGA TGGGAAAAAC
CCGGCAGCAG GCGAACGTCC TGAGATGGTG GGCTCGAAGG TGGGCCAGTC GTGGAAGCTC
AAGTGCATCG CTGTCTGGTC GGATGGAACC CGCGAAGATG TAACACCTTT GTCCCGCTTC
CAGTCGAATA ATGATCAGAT CGCCAAAGTG ACCCAGGAAG GCATGGTGAC GATCACCGGC
CCCGGAGATT CACATGTCGT CGCGTTCTAT GACAATGGCG TGGTGCCTGT CCCTGTGATT
CTGCCTGTGT CGAACTTCAC AGGTGATCAA TATCCTTCCA CTCCCACACC GACTCGTATT
GATGAACTGG TTGTGCAGAA GCTCAAGAAG GTGGGCGTCG TGCAGTCGGA TCTGTGTGGC
GATGAAGAGT TCCTCCGCCG GGTTTCGCTG GATATCACGG GAACTTTGCC GACACCTGAG
GAAGTCGAGA CGTTTGTGCG TTCGAAGGAT GTCGCCAAGC GTTCGAAAAA GATTGATGAA
TTGTTGGAGC GACCAGCTTA TGCCGCCTGG CAGGCGACCA AGATTTGCGA CTACACCGGG
AACAATCCCG AGTATTTGCA GAACGCGGTG GTGGGGAACA ATTCTGCTCA GGCTGCTCGC
GACTGGTATG AGTGGATTCA TGACCGAGTT CAAAGAAACG TCCCTTATGA TGAATTGGCC
GAGAACATCA TTCTGGCGAC GAGTCGCCGG GAGGGGGAAA GCTACGAACA GTATTGCGAA
CGTGTGAGTG GCTACTATGC GAAGGACAGC AAAGGGAGTT TTGCTGAACA GCCTGCGATG
ACGCATTACT GGGCGAGGAG AAACTTCCGC ACGGTCGAAG AGCGGGCTCT CGGATTTGCC
TACACGTTCC TGGGCATTCG AGTGCAATGT GCCCAGTGTC ATAAGCATCC TTTCGATCAA
TGGACAAAGG ACGATTTCGA TCGTTTCAAG AACTTTTTTG CCCGCATTCG TTACGGCGAT
ATCCCGGGTA ACAAAGACGA AAAAGCCGCC ATGCTGGCCA AGCTGGGTGT GGACAAGGAC
CTCAAGGGGA ACATGCTGGA TCGGGCTCTG AAGGACTATC TCGCTTCGGG CAAAGTGGTG
CCGGTTCAGG AAGTCTTTGT GACTCCTCCA GTCAAGCCAA GACCAGTCAA TCCCAAGGCC
AAGCCCAATG CGAAAAGACC GCAGGTTGTG GCTGGCCGAA CAGCCAAGGT GCTGGGTGGC
GATGAGATTG TCATCGAAGA ACTTGATGAT CCGCGAACAG CACTCATGGA TTGGTTGCGT
GCCGAAGAGA ACCCGTACTT TGCCAGAGCT TTTGTGAATC GTGTCTGGGC GGGCTACTTC
CATGTGGGGA TTGTTGAGCC ACCCGATGAT TTGAGTCTGG CGAATCCTCC TTCGAATGAA
GCGTTGCTGG ATGAACTGAC ACGGGAGTTC GTGGCTCATG GCTACGATAT GAAGTGGCTG
CACAGGACGA TTGCCAACAG CCGCACTTAT CAGTTGAGCT GGCAGCCCAA TGAAACCAAC
AAGCTTGACG AGCGGAACTT TGCCCGGGCT GTGCCGCGCC GTCTTCCTGC CGAAGTGGCG
TATGACATTA TCCGTCAGGC GACTGCCAGT GATTTTGAAA TGGCCAAGTG GAATGACCAG
CTCAATTCCC GGGCGATTGA AGATGTGGGG GCCGGTGCGA AGAATGGCCG GGCTCAGGTG
TATGCATTGA ATATCTTCGG GCGATCGATC CGCGAGAGTA ACTGCGATTG CGATCGATCG
ATGGAGCCAA GTCTGCTGCA GACGGTTTAT CTGCAGAACG ATCAGGAGTT GCTGGCTGCC
ATTGAGCGCA AGGGTGGCTG GGTCGACCAG ATTGTCAAAG TGGGCCCTCA TCCTGTGGTG
AGTGACAAAA ATGCCATCAC ACCAGCAGCA GCGGTCGTGC CTGATCTCAC CGACGAAAAG
GGTAAAGGCA AAAAGAAACT TGAGTCGGCA ACCGAAGATT CGTCCAACGA TCGGGATTTG
AAAGAGGTTC TGGCCCGCGT CGACAAACGT ATCGAAAAGG CCCGTCAACA AAAGGACAAG
CAGCAACTGG CCGAACTGCA GCAAACTCGG CAGAAAGTTC TGGAGAAAAT CCAGGCCCAG
GCCAAAACAA CCGCCAAACC TGTCAATCAG GGCTATAAGG CGGGAGTTCT GCCCAGTGAT
CGGATTCTGC AAGAGATTGT CAAAACGGCT TATCTGCGAA CGTTATCGCG ATATCCATCG
AACGAAGAAA CTCGCCGGTC AGTGGCTTAC TTCCATGAGG CGAAAGATGT CCGCGTCGGC
AGCAGAGATC TCTTATGGGC CCTGCTGAAC ACCAAGGAAT TCATGGTCAA TCATTAA
 
Protein sequence
MQELANFHDP AVLEHRRWPA LGHRCCYWVC LAITGWLGLC QPGLSQLPQV WAAEAKPADN 
SKPAAQILPA EELKSSAVVE SAVIPGINER YDLKVKNVDG PDFRKHVVPL LGKLGCNGRA
CHGSFQGQGG FRLSLFGYDF KADHENLVLG DKPRVNLMDR SKSLILMKPT EQVPHEGGEA
LKQGTWEYRV LDKWISQGAK PVTDKTPEFV RLDVLPDGKN PAAGERPEMV GSKVGQSWKL
KCIAVWSDGT REDVTPLSRF QSNNDQIAKV TQEGMVTITG PGDSHVVAFY DNGVVPVPVI
LPVSNFTGDQ YPSTPTPTRI DELVVQKLKK VGVVQSDLCG DEEFLRRVSL DITGTLPTPE
EVETFVRSKD VAKRSKKIDE LLERPAYAAW QATKICDYTG NNPEYLQNAV VGNNSAQAAR
DWYEWIHDRV QRNVPYDELA ENIILATSRR EGESYEQYCE RVSGYYAKDS KGSFAEQPAM
THYWARRNFR TVEERALGFA YTFLGIRVQC AQCHKHPFDQ WTKDDFDRFK NFFARIRYGD
IPGNKDEKAA MLAKLGVDKD LKGNMLDRAL KDYLASGKVV PVQEVFVTPP VKPRPVNPKA
KPNAKRPQVV AGRTAKVLGG DEIVIEELDD PRTALMDWLR AEENPYFARA FVNRVWAGYF
HVGIVEPPDD LSLANPPSNE ALLDELTREF VAHGYDMKWL HRTIANSRTY QLSWQPNETN
KLDERNFARA VPRRLPAEVA YDIIRQATAS DFEMAKWNDQ LNSRAIEDVG AGAKNGRAQV
YALNIFGRSI RESNCDCDRS MEPSLLQTVY LQNDQELLAA IERKGGWVDQ IVKVGPHPVV
SDKNAITPAA AVVPDLTDEK GKGKKKLESA TEDSSNDRDL KEVLARVDKR IEKARQQKDK
QQLAELQQTR QKVLEKIQAQ AKTTAKPVNQ GYKAGVLPSD RILQEIVKTA YLRTLSRYPS
NEETRRSVAY FHEAKDVRVG SRDLLWALLN TKEFMVNH