Gene Plim_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4237 
Symbol 
ID9140959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5411388 
End bp5413013 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content56% 
IMG OID 
ProductProtein of unknown function DUF1800 
Protein accessionYP_003632243 
Protein GI296124465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000229991 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGCA ATCGTGTCCA TGAAGCACAG CAGTCGGGAT GGTCGTCACT AGCCGCCCGG 
CATCTCCTCG CGAGGACGGG ATTTGGCTTC GATCTGAAGC ACGAAGAGAA ACTGGCTGCC
TTATCACGAG AACAGGCAGT CGATCAGCTT CTGGAAGACG CTCAGAAGGC CCCTTGCCCT
ACGCCGCCCG AGTGGGTCAA AACACCGTGG GTGAACACAG AGCGGCGATA TGCAGATACC
ACTGCCGAAG AATTTCGCGG CAAGCATGGT GCGACCAACG GTCGGTATGC CCGTGAGATT
GCTGATCTGC GCCGCTGGTG GGTGAATGAG ATGCTGACGT CTCTGGTACC ACTGCGGGAG
ATGATGACAC TCTTCTGGCA CAGCCACTTT GCCTCGAGCA TTGGCAAGGT TCTCATATCT
CAAGCCATGT ACGAGCAGAA TGCGACTCAA CGAAAGCATG CGTTGGGGAA CTTTCGTCAG
CTGCTGAGAG CCATGACGAT TGATGGCGCC ATGATGATTT ATCTCGATCT GGAGGACAGC
GAAAAAACAC AGCCCAATGA GAATTACGCC CGCGAACTCT TTGAACTCTT TGCTCTCGGG
CACGGTCATT ACACTCAGGC CGATATTCGC GAGGCGGCTC GTGCACTCTC AGGGTGGCAG
CTGAATGCTC CACCTGGCAC AACCTTACCC CAGAGACCAA CCAACCCAGC GGATAATCGA
CGCTTTACGC GCGATGGTAT TGTCGCAGAA CTGGTGGTCG AGCGGCACGA TTCGGGCACT
AAAACAATCT TTGGAAAAAC AGGCACATTC GGTCTCGATG AAGTGATCGA ACTCACAGTC
AATCACCCGG CTTGCGGATT ATTTCTTGCT GAAAAACTGG CCAGCTACTT TGGTCTGAGT
GATGAAACCG GTCGAGTCCA ACAGCAGATG GCCGAGGTGT TCCGATCAAC CGGTGGTGAG
ATCGCACCGA TGCTGCGCGT GCTGTTCACT GCTGACGAGT TTTACACCGA TGGGTCCAGA
AATCGTCTCA TCAAGAGTCC GGTCGTGCTC TGGGTGAGCA CCTGTCGTCA ATTGCAACTC
GACACAACCT ATACGCGAGG AGTCAACAAA TATCTGGCAG CCCTGGGTCA GGAACTCTTT
GAGCCACCCA ATGTGAAAGG CTGGCCAGGT GGTGAAACAT GGATCAGCGC TGGAACTCTG
GCACTGAGAT ACCACCTCAC GGATATTGTT CTGGAATCGA AAGAGCCTCC AGGAATGGAC
CCCATGGGGC GGGATCGCGG TCGGCCGGTC ATGCTTCCCA AAGATCCTGC GGAGCGAGCC
AGATTTCTTG CTCGTATGGG AGGAGGAATG TCTGAGGGTG AAGGGATGAT GGCACCTCGT
CGCGGCGAAC GGGAAACCGG CCCCGCTTAC GAAGTCAAAT TCTCGCCAGA CAAACTCTTT
CCTGCCGGTC TCCCGGACTC GTCTGCCGAT CTGGCTGACC AACTGCTGAA TCGATTGCTG
GTGGATTTAC CACGGGCAGA ACTTCGGGCA GCTGCCATGG AGGCGGCGAC ACGAAACGTG
GGACCTTTGC GGGTTCAGGC AGTCCTGCGG CTGATCCTCA GTTCGCCCGA TTACCAGCTC
ACATGA
 
Protein sequence
MPRNRVHEAQ QSGWSSLAAR HLLARTGFGF DLKHEEKLAA LSREQAVDQL LEDAQKAPCP 
TPPEWVKTPW VNTERRYADT TAEEFRGKHG ATNGRYAREI ADLRRWWVNE MLTSLVPLRE
MMTLFWHSHF ASSIGKVLIS QAMYEQNATQ RKHALGNFRQ LLRAMTIDGA MMIYLDLEDS
EKTQPNENYA RELFELFALG HGHYTQADIR EAARALSGWQ LNAPPGTTLP QRPTNPADNR
RFTRDGIVAE LVVERHDSGT KTIFGKTGTF GLDEVIELTV NHPACGLFLA EKLASYFGLS
DETGRVQQQM AEVFRSTGGE IAPMLRVLFT ADEFYTDGSR NRLIKSPVVL WVSTCRQLQL
DTTYTRGVNK YLAALGQELF EPPNVKGWPG GETWISAGTL ALRYHLTDIV LESKEPPGMD
PMGRDRGRPV MLPKDPAERA RFLARMGGGM SEGEGMMAPR RGERETGPAY EVKFSPDKLF
PAGLPDSSAD LADQLLNRLL VDLPRAELRA AAMEAATRNV GPLRVQAVLR LILSSPDYQL
T