Gene Plim_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0115 
Symbol 
ID9136769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp157477 
End bp158766 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content54% 
IMG OID 
ProductStage II sporulation E family protein 
Protein accessionYP_003628167 
Protein GI296120389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.90623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGACA TCACTCACGA CCACGCCAGC CAGCATCGCA TTCTTGCCTT GAGCCGCATG 
CTGGAACTGG TACTCAAGCT CAATGATGTC AACGACCTCG CTGCCGTCCT GAACGCGATT
ACCCATGGTG CCTGCCAGGC CGTGGAATGT GATCGAGCCA GTCTTTTTCT CTACGATTCT
CAAGCGGACG AACTCTACTC TCACTCGACG ACCGAATTGG AAATCCGCGA GATTCGCCTT
CCCATTGATA GCGGCATCAT TGGACACGTT GCCCGGGAGC GCGTGCCGCT GGTTGTCGCA
CAACCTCATG ACGATTCGCG ATGGTCACCC GTGTTTGACC AGCAGACGGG CTATCGAACT
CGAAACATTC TGGCTGTGCC CGTGGTCTCC AAATCGGAGA ATCGACTTCT CGGTGTTCTC
CAACTGCTCA ACAAGCATGC AGACTGTTTC GAAGACTTCG ATGTTCAATT GATGCAGGCC
TTTGCCGTGC ATGCGGGAGC CGCCATCGAA AGGCAGTGGC TGCAATCGGC TGCCAGGCAA
GCGGAGGATT ACCGCCGCTC GATGGAAATG GCCCGACAGA TTCAAATGGG GTTTTTGCCG
AAAGAACGCC CACGCTTCCC GCAATACGAA GTGACTGCCT GGTGGGAACC TGCCGAGTAT
GTGAGTGGTG ACTATTACGA CTGGCTCGTG GCTCCCGATG GTCGCCTGGG CATCGCTGTA
GGGGATGTCA GCGGTCATGG CATGGCTGCC AGTCTGATCA TGGCGTCGGT GCGGGCCATG
ATTCATGCGC TGGGAAGATT GGCAACCACA CCAGCCGATA TGATGGAACT TTTGCAGGAT
GTCGTGCAGG CCGATCTCGT CGATGGTCGA TTTCTGACAC TTTTACTTCT GACCATCGAT
CCCGTGACTC ATTGCGTAGA ATTTGCCAAT GCGGGTCATG CACCTGCGAT CTACTATCAA
TCGTCGAAAC GCTCCTGCAG ACGACTCGAA GCCACTCGCA CGCCACTGGG ATTTCCCGTC
AATCAATCCT TGCGATTGCC CATCTATCCC GTCATGCAGC CTGGCGATCT TCTCATTCTC
GGTACGGATG GACTCATTGA AGTTCATGAC GAGAAAGGCA AGATTTTTGG CATGCAACGC
CTCGAAGAAC TCATAGCAAA ACATGCACAT GAGTCCGTTG AAACGTTATC TCAGATTCTG
AAACGTGAAG TTTTCGATTT CTCCACAGAA CATCCCTTTC CAGACGACAT GACGTTGCTC
ATCATTAAGC GCGTCGACGA AGCCGCCTGA
 
Protein sequence
MHDITHDHAS QHRILALSRM LELVLKLNDV NDLAAVLNAI THGACQAVEC DRASLFLYDS 
QADELYSHST TELEIREIRL PIDSGIIGHV ARERVPLVVA QPHDDSRWSP VFDQQTGYRT
RNILAVPVVS KSENRLLGVL QLLNKHADCF EDFDVQLMQA FAVHAGAAIE RQWLQSAARQ
AEDYRRSMEM ARQIQMGFLP KERPRFPQYE VTAWWEPAEY VSGDYYDWLV APDGRLGIAV
GDVSGHGMAA SLIMASVRAM IHALGRLATT PADMMELLQD VVQADLVDGR FLTLLLLTID
PVTHCVEFAN AGHAPAIYYQ SSKRSCRRLE ATRTPLGFPV NQSLRLPIYP VMQPGDLLIL
GTDGLIEVHD EKGKIFGMQR LEELIAKHAH ESVETLSQIL KREVFDFSTE HPFPDDMTLL
IIKRVDEAA