Gene Plim_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4087 
Symbol 
ID9140807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5242682 
End bp5244043 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF21 
Protein accessionYP_003632097 
Protein GI296124319 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAATGGG GACTTTTCGG CGAGCTGGCT CTCATCCTCT TGCTGATTCT GTTCAACGGA 
TTCTTCGCCG GTGCGGAGAT CGCGATTCTG ACTGCCAAAC GCGGGCGGCT GGAACAGCTT
TCACAGGAAG GCGATCGTGG TGCCAAAGCG GCTCTCAAGC TCTCCAGTGA CGCCGATCGG
TTTTTACCCA CTGTTCAAGT TGGGATCACA CTCGTCGGTA CGTTTGCCGC CGCCTTTGGT
GGTGCCAGCT TCATCAGCGA GGTCTCGCAT CTCATTGGAC AAATCCCTGT TTCCTGGATT
CAGCAGCGTA GTGAAACGAT TTCTCTGGGT GTCATTTCGG TCGGGATTGC CTTCTTTTCG
CTGATTCTGG GCGAACTCGT TCCCAAGCGA GTCGCGTTGC AGAATGCTGA GTTCATGGCC
CGCTGGGTGG CCTTGCCCAT GGTACTTCTC CAGACCATTG CCCAGCCATT TGTCTGGTTC
CTGCGCGTCT GTACCAAATC CGTGTTGCTC ATTCTGGGCC AGAAAACCGA GATCCGCGAC
AGTGTCTCGG TCGAAGACAT TCAGCACCTG ATTGATGCCG GTCATGAAGC CGGAATTCTG
CACGAGGCCG AACAGCAGAT GGCCCAGCAG GCTTTAAAAA TGCGCGAGCG GACAGCCGCC
GAAATTCTCA GGCCACGAAT CGATATTGAT GCGATCGACG TCGATACACC CCCCGAAGAA
GTCCTGGGAG CCATGGCCAT GTCGGGCTTC TCCCGGGTAC CTGTTTGCGA AGGGAGCATC
GATCGGATTG TCGGCTTCAT TTACATCAAA GACGTCTTTC TCGAAAACTA TCTGGGCAGG
TCTCTCGATA TCCGCCGGGT GATGCGCGCC CCGCTCTTTA TTCCCAAAAC GCTGACCATC
TCGAAACTGC TCGAACTCTT CCAGAAAGAG CGGACTCAAC TCGCGATCGT GCTCGACGAA
TATGGTGGTA CCGAAGGGAT GGTCACCCTC GAAGATGTCA TGGAAATCCT CGTCGGCTCA
ATTCATGACG AGCATCGCCG CGATGACGAG CAACTGATTG TCCGACGTGC TGATGGCAGT
CTGCTGGCAG ATGCTGCTCT GAACCTGCAT GAACTGCAGG AAGCGTTGGA ATTCAGCAAA
TGGCCTGAAC CTCCTCCTCG AGGCATTGCC ACCATCTCGG GACTGGTCGT CGCCCTGCTT
AAGCGACCTC CCAAAATTGG AGATATCATC CAATGGAGCC AACTCCGCGT CGAAGTGGTC
GATATGGATG GCCCGCGGAT CGACCGGCTG CTTGTGAGTC GCATCGTTCC TGAATCCTCG
AACGAAGCCG AGGCCAAACC ACAGGAAGAA ACGCAAAGCT AG
 
Protein sequence
MEWGLFGELA LILLLILFNG FFAGAEIAIL TAKRGRLEQL SQEGDRGAKA ALKLSSDADR 
FLPTVQVGIT LVGTFAAAFG GASFISEVSH LIGQIPVSWI QQRSETISLG VISVGIAFFS
LILGELVPKR VALQNAEFMA RWVALPMVLL QTIAQPFVWF LRVCTKSVLL ILGQKTEIRD
SVSVEDIQHL IDAGHEAGIL HEAEQQMAQQ ALKMRERTAA EILRPRIDID AIDVDTPPEE
VLGAMAMSGF SRVPVCEGSI DRIVGFIYIK DVFLENYLGR SLDIRRVMRA PLFIPKTLTI
SKLLELFQKE RTQLAIVLDE YGGTEGMVTL EDVMEILVGS IHDEHRRDDE QLIVRRADGS
LLADAALNLH ELQEALEFSK WPEPPPRGIA TISGLVVALL KRPPKIGDII QWSQLRVEVV
DMDGPRIDRL LVSRIVPESS NEAEAKPQEE TQS