Gene Plim_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3158 
Symbol 
ID9139872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4086227 
End bp4087558 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content55% 
IMG OID 
ProductStage II sporulation E family protein 
Protein accessionYP_003631172 
Protein GI296123394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGCT TGCGGGAAAG GTGTGTTGTG GCGACTCCTT TGAATCAGAT TTTTCCTGGT 
GCTCCTCAGG AGTGGCAGGA ACGTTTGCGC ACGATCGTCG ACATGATGCG CGAGATGAGT
CTGCAACACG ATCCTCAGGC CATGGTTCGC GCCTACGGTC GCCGGATGGT TGAACTCATG
CCCAGCATGC GGCGTATTTC TCTTTCGAGA CGAGATCTGC CGCCAGGCCA GGTACGTGTC
ACTCGCTTCA GTGGCTGGCA GCATGAAGTC AATCCGTGGA AGGAGCCCGA AAAGCTGCCA
CATTTGAAAG GCGGTCTCCT CGCAGAACTC ATCTGGGCCG ATGAGCCACA AATTCTCAAT
GATTGTCATA TCTCGCAAGA TGACCCCTCT TATGAATTCC TGAGTGGCCA GCGCAGCATC
ATGGCGATTC CCCTGTTCGA TAAAGGAACG GCTCAGAATA TGGTGGTGCT CTCGAATGGC
GAGCCCCATG GCTTCTCGCC CGATGAACTT CCCGAACGTG TCTGGATGGC CAATCTCTTC
GGGCAGGCGA CACAAAATCT GGTCTTGGCC GAGCAACTTG AGGCGGCCTA TCAGCAGGTC
GAACGCGAAC TTCAGGTGGT CGCCGACATT CAAAGATCCA TGCTCCCCAA GGAACTTCCA
CGCAGGGCTG GCCTCGAAGT CTCGGCTCAC TACCAGACAT CCCGCCAGGC CGGTGGCGAC
TATTACGATT TCTTCAATTA TCCCGATGAC TCAATGGGCA TCCTTGTGGC CGATGTCAGT
GGTCACGGCA CACCTTCGGC TGTGATGATG GCTTTAACTC ATTGCATTGC CCACATGCTT
GCCGGCCCAC CGACAGAGCC CAGTGAACTC CTGGCTCATC TCAATGACCA CCTGACGAGC
CGGTACACAC TCGAATCCAA TCACTTTATC ACAGCTTTTT TCGGCATCTA TCATCCGCGC
ACCAGGTGCA TTGAGTATTC ATCGGCAGGG CACAATCCGC CCCGGCTCTG GCGCGCGCGA
ACTGGCGAAA TCATCTCTCT CGAAAATGCC ACCTCGCTAC CCTTGGGAAT TGCTCCCGAT
TTAACATGGC CGAACGCCAG TTTAGAACTG CAGGAAGGAG ATCGACTCGT CATCTATACC
GACGGTATTG TCGAAGCCGC TGATCCGCAA TTTGAACTCT TTGGCATGGA GCGACTGGAC
GACCTCATTC GTCAGGTGCA GGGGACACCC GAAGAACTTC GCAATGCCAT CCTGACTGCG
GTCGATGTGT TCACGCATCA CGCTCCGGCT TCTGATGACC GCACGCTACT CGTTATTGAT
CAATTGACGT AG
 
Protein sequence
MASLRERCVV ATPLNQIFPG APQEWQERLR TIVDMMREMS LQHDPQAMVR AYGRRMVELM 
PSMRRISLSR RDLPPGQVRV TRFSGWQHEV NPWKEPEKLP HLKGGLLAEL IWADEPQILN
DCHISQDDPS YEFLSGQRSI MAIPLFDKGT AQNMVVLSNG EPHGFSPDEL PERVWMANLF
GQATQNLVLA EQLEAAYQQV ERELQVVADI QRSMLPKELP RRAGLEVSAH YQTSRQAGGD
YYDFFNYPDD SMGILVADVS GHGTPSAVMM ALTHCIAHML AGPPTEPSEL LAHLNDHLTS
RYTLESNHFI TAFFGIYHPR TRCIEYSSAG HNPPRLWRAR TGEIISLENA TSLPLGIAPD
LTWPNASLEL QEGDRLVIYT DGIVEAADPQ FELFGMERLD DLIRQVQGTP EELRNAILTA
VDVFTHHAPA SDDRTLLVID QLT