Gene Plim_4138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4138 
Symbol 
ID9140858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5307151 
End bp5308611 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content56% 
IMG OID 
Productprotein of unknown function DUF1501 
Protein accessionYP_003632148 
Protein GI296124370 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.322699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAC CACTCCCGAA ATCTGATGGG CCTAACCTCA GTGCGATGGC TCGCCGCCAG 
TGGCTGGCCC AGGCTGGGAA TGGATGCGGA AGTCTGGCCC TGGCTCATCT ACTGACAGGC
TCGACATCAG GTTTATCGTT TGGGGCAGAA CCCACTGCAA CACGTCAGCC ACATTTCGCC
CCCAAAGCCA AACGAGTCAT TTACCTGTTT CAAAGTGGTG GGCCTTCGCA ACTGGAAACC
TTCGACTATA AGCCCCTCCT CAACGATCGC TTTGGAGAAG CCTTGCCAGA TTCTGTCCGA
CAAGGCCAGC GATTGACCGG GATGTCGGCC AATCAATCCT CTTTGATCAT GGTGGGATCG
AAGTTTGGTT TTCAGCAGCA TGGTGCCGCA CAGGCGTGGG TCAGCGATTT ACTGCCACAT
ACGGCCAAAA TCGTCGATGA CCTCTGCATC GTCCGCTCCA TCTACACAGA AGCGATCAAC
CATGATCCAG CCATTACGTT CCTGCAGACC GGTTCTCAGA TTGCCGGTAG ACCCAGTATT
GGTGCCTGGC TCAACTATGG TTTAGGAAGT GATAACGAGA ATCTCCCCGC GTTCGTGGTG
TTGATTACTC GTGGAAAAAC CGATCAGCCG CTCTATTCTC GCTTGTGGGG CTCTGGCTTT
CTTCCATCGC AGCATCAGGG AGTTCAGTTC CGCTCGGGTA AAGATGCGGT TCTTTATCTG
AACAATCCGC CGGGAGTCAG TCGCGAGCAA AGACGTGCTG CTCTCGAAAC ATTGGCACAA
CTGCAGAAAG AGCGTGAAGT CGAGACGCAC GACCCTGAGT TATCTCAGCG GATTGCCCAG
TATGAAATGG CATTTCGTAT GCAGGCCAGC GTGCCAGAAG CCACCGATCT TTCGGGCGAG
ACCGAAGCCA CTTTCGAGAT GTATGGGCCC GACAGCCGCA AGCCCGGCAC ATTTGCCGCC
AATTGTCTCT TGGCAAGACG CCTGGCAGAG CGTGGTGTCA AGTTTATCCA GTTGTATCAC
CAGGGCTGGG ATCAACATGG CGGCTTACCG GGAGGAATTT CGACGCAGTG CCGGGAGACC
GATCAGCCTT CCGCTGCCCT GGTGGCTGAT CTGAAACAGC GTGGTTTGCT GGAGGACACC
CTGGTCGTGT GGGGTGGAGA GTTTGGCCGG ACAAGTTATT CGCAGGGCCG CCCGTCACCG
CAGAACTACG GCCGGGATCA CCATCCCCGG TGTTTCTCGA TGTGGTTTGC GGGTGGTGGG
ATCAAGCCCG CGACGGTGGT CGGCCGGACC TGCGATTTTG GTTACAACAT TGAAGATCGT
CCAGTGCACG TCCACGATCT GCATGCCACC ATGCTGCATC TACTGGGTAT GGATCACGAA
CGCCTGACCT ACCGCTTCCA GGGCCGCGAC TTCCGCCTAA CCGACGTCCA CGGGCACGTG
ATTCCGGAAA TGCTGGCTTG A
 
Protein sequence
MNQPLPKSDG PNLSAMARRQ WLAQAGNGCG SLALAHLLTG STSGLSFGAE PTATRQPHFA 
PKAKRVIYLF QSGGPSQLET FDYKPLLNDR FGEALPDSVR QGQRLTGMSA NQSSLIMVGS
KFGFQQHGAA QAWVSDLLPH TAKIVDDLCI VRSIYTEAIN HDPAITFLQT GSQIAGRPSI
GAWLNYGLGS DNENLPAFVV LITRGKTDQP LYSRLWGSGF LPSQHQGVQF RSGKDAVLYL
NNPPGVSREQ RRAALETLAQ LQKEREVETH DPELSQRIAQ YEMAFRMQAS VPEATDLSGE
TEATFEMYGP DSRKPGTFAA NCLLARRLAE RGVKFIQLYH QGWDQHGGLP GGISTQCRET
DQPSAALVAD LKQRGLLEDT LVVWGGEFGR TSYSQGRPSP QNYGRDHHPR CFSMWFAGGG
IKPATVVGRT CDFGYNIEDR PVHVHDLHAT MLHLLGMDHE RLTYRFQGRD FRLTDVHGHV
IPEMLA