Gene Plim_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4084 
Symbol 
ID9140804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5239148 
End bp5240290 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF1559 
Protein accessionYP_003632094 
Protein GI296124316 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCGTT CATACCGTCA CTCCGGCCGC AGGGTAGAAA CCTACTCACT CAAAGTGCGC 
CGGTCGTCAA GTGACTATCT ATACAACATT GATCCCGAGA AAGACCTCTT TATGAAAGTT
CCTGATACTC CTCGTCACCG AGGCTTCACA CTCATTGAAC TCCTGGTGGT GATCGCGATC
ATCGCCATTT TGATTGCTCT GCTGCTTCCC GCCGTTCAAC AGGCTCGGGA AGCAGCCCGG
CGAACGCAGT GCCGTAACAA TCTCAAGCAG ATTGGCCTCG CTCTGCACAA CTATGAATCG
ACATTCGGCC GCTTTCCCTG CGGCTGGAAC GGACACAACA ACGTCGCACA AAGCACCACG
ATGCGCTGGA GTTTCCTGGC GTATATCCTC CCTTACGTCG ATCAGGCCAA CACGCTCAAT
CAGTTGGATC TGAACTGGTC CCTCTATCCG CCCGGAGGTG GCCAGCCGCC ACGTGCCATG
CACGTCAATA CGATCATGAC AAAGATTCCG ACCTATCTTT GTCCGAGTGA TCGCTCGGAC
TATGTTTCTA GTCCCACAGG GGTGATTGAC TCTGCTCCTT CCAACTATAT GGCCTGCATG
GGTTCTGGGA TAAACAATGT GGCCGATATC AGTGATGATG GTCAGAGCGA CGACCGTGCC
GATGGTCTAT TCAGTTCCAT CTCCTGGCGA AGGATTGCCG ATTGTACGGA TGGCTTATCG
AACACGGTTC TTTGCTCCGA AAGTCTATTA GGGATTGGTG GTGCCGACCC GGCTTCTACT
GAGAGTCCTG ATGCACAGAC GCATATGGCA TTGGTCAGCC CTCCCACGAG TGTGACAATT
GCCAATTGTG ATCAGGCGAG GCCCGCGAGT ATCGCCCGCT TCGTGGCCAG TCGAAATCGA
GTCTGGGCGG GTCAGGCGTA CGAGAACACC GCTTACAACC ACTACTTCAC ACCGAACAGC
CGGCGCTACG ACTGCTACTT CTGGGTGGCG CAAGGCTTCA AGGCCGCCCG CAGTCGGCAT
ACAGGCGGTG TCCATACCCT CATGGGCGAT GGGGGAGTTC GATTCACCAG TGAAAACATC
GACGCCACGA TCTGGCGGAA CATTGCCACA CGTTCTGGTA GTGAAGTCGT CAGCGAATTC
TAA
 
Protein sequence
MYRSYRHSGR RVETYSLKVR RSSSDYLYNI DPEKDLFMKV PDTPRHRGFT LIELLVVIAI 
IAILIALLLP AVQQAREAAR RTQCRNNLKQ IGLALHNYES TFGRFPCGWN GHNNVAQSTT
MRWSFLAYIL PYVDQANTLN QLDLNWSLYP PGGGQPPRAM HVNTIMTKIP TYLCPSDRSD
YVSSPTGVID SAPSNYMACM GSGINNVADI SDDGQSDDRA DGLFSSISWR RIADCTDGLS
NTVLCSESLL GIGGADPAST ESPDAQTHMA LVSPPTSVTI ANCDQARPAS IARFVASRNR
VWAGQAYENT AYNHYFTPNS RRYDCYFWVA QGFKAARSRH TGGVHTLMGD GGVRFTSENI
DATIWRNIAT RSGSEVVSEF