Gene Plim_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4236 
Symbol 
ID9140958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5410101 
End bp5411336 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF1501 
Protein accessionYP_003632242 
Protein GI296124464 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00624743 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATCT CTTTTTCCAG TCGACGCGAA TTTCTGAAGC AGGCAGGACT GTTGACTGCC 
TGGAGTCTGA CACTTCCGCA GTTTGTGGTG CAGTCACGCC AGGCGCTGGC TCATGCACCC
ATTGAGGGTT TGCCGGATGA TCGCATCCTC GTGCTGGTTC AACTGGCTGG CGGGAACGAT
GGGCTTAACA CACTCGTCCC CTACGGTGAT GATCTGTATT ACAAGGCACG TCCCAAGCTC
TCTGTGGCGC AGGAAGACGT GCTCAGGATT GACGATTACT GTGGTTTCCA TTCAGAAATG
TACGCTCTGC GGGAGTTGTG GGAAGATGGC CTGCTCAGTC TGATTCAGGG TGTGGGATAC
CCGAATCCTG ACCGCTCGCA CTTTCGATCC ACCGAAATCT GGGAGACGGC TTCGGGATCG
GAGAAGAATA TCGCCAGTGG CTGGATTGGC CGATACTTTG ACAGCGAATG CTCAAAAGCG
GCGACACCAA CACTGGGTGT GCAGCTTGGC GAACGAACGG CACAAACCTT TGCCGGCGAT
CATCCGCGCG TTGTGACTCT CTCGAATCCT CAGCTCTTTC AGTTTTCCGG CGGATCAGCG
CGAGAAGACG AGTTGGCCAA AGTTCATGTT CCCTCAGTGA GTGCGAATTC TTCTTTGGCA
TTTCTGCAGC GAACAGGGAA CGACGTCCTG TCTGTTTCGA GACAGCTTTC CGAAAAGGTG
AGGTTGCAGC CGACAACAAG GGATTACCTG CCCTATCAGT TTTCGCAGAC ACTCAGACTG
GTGGCGAAAA TGATTGCCGC AGAGGTTCCG ACCAGGGTCT ATTACGTATC ACTGCCCGGG
TTTGATCATC ATGCCACACA GAAGATGCGT CATGCGATGC TGTTGCAGGA ACTGAGTGAG
AGCCTCTCAA GCTTTGTGCG CGATTTGAAA AACTTAGGGC ATCTGGATCG CACACTGATT
GTGACCTTTT CTGAGTTTGG CCGCCGTGTG GCCGAGAACC AGAGCGAAGG AACGGATCAT
GGGACGGCGA ACCTCATGTT CATGGCGGGA GGAACTTCCC GAGCAGGGTT CCACGGAACG
CGTTCCGATC TTGCCCGACT GGATGACGTG GGGGACTTAC ACCACACCAC TGATTTCCGC
AGCGTTTATG CCTCGATTCT CAAGGACTGG CTGGGAGCCA ACCCCGCCAG CATCCTCGAT
CCGTCCATTC TGCCTATGGC AGGAATCCTT GGCTGA
 
Protein sequence
MAISFSSRRE FLKQAGLLTA WSLTLPQFVV QSRQALAHAP IEGLPDDRIL VLVQLAGGND 
GLNTLVPYGD DLYYKARPKL SVAQEDVLRI DDYCGFHSEM YALRELWEDG LLSLIQGVGY
PNPDRSHFRS TEIWETASGS EKNIASGWIG RYFDSECSKA ATPTLGVQLG ERTAQTFAGD
HPRVVTLSNP QLFQFSGGSA REDELAKVHV PSVSANSSLA FLQRTGNDVL SVSRQLSEKV
RLQPTTRDYL PYQFSQTLRL VAKMIAAEVP TRVYYVSLPG FDHHATQKMR HAMLLQELSE
SLSSFVRDLK NLGHLDRTLI VTFSEFGRRV AENQSEGTDH GTANLMFMAG GTSRAGFHGT
RSDLARLDDV GDLHHTTDFR SVYASILKDW LGANPASILD PSILPMAGIL G