Gene Plim_1436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1436 
Symbol 
ID9138131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1840833 
End bp1844009 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content51% 
IMG OID 
Productheme-binding protein 
Protein accessionYP_003629469 
Protein GI296121691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGCA TAGAGTTTGA CGTTCTTCTG GCACCGTGCC TGACAGTTCT TCTGATGATT 
CAACCGGCCT GGGCTGAATC CGAACGCTAT TCCATCACAG TGCCCGAAGG CTTTGACATT
CGACAGGCGG CCAGTTTTCC GCTCGTCGAA CGCCCAATGT TTGCAGCACT TGATCAGTCA
GGACATCTCT ATGTTCTTGA TTCGGGTGGA TCGAATGGTG GAGATCGATT GAAAAACCCC
ACCGATGTCA TTCGTCGCCT GACGGATACC AATGGCGACG GTATCTACGA TCAAAGCACA
ATTTTTGCTG ATAAAATCGT GTTTGGCACA GGGATTGCCT GCCACGATGG AGCCGTATTC
ATCACATCTC CACCCAGCCT CTGGAGGTTT GAAGATACCA CTGGCGACGG AATTGCCGAT
CAGCGTGTAG AGCTCGTGAC TGGATTTGCT TTCAATCAAA GTTGTACTGA TGATCTGCAT
GGCACGACTG TGGGGCCAGA TGGTCGAATC TATTTTTTAC CTGGTCGATT TCACCATAAA
GTCCGCCTGA AGGACGGTAC GCCTCTACGA GATGGTGTTG GCCCGTGGCT GATGCGCTGC
CGACCGGATG GAAGTGATGT CGAGTTTGTC TCTGGTGCAG TAGGAAATCC CGTCGAAGTG
GCTTTTCTCC CCAATGGTGA TTCATTTATT CAGGGGACAT TCTGGGCGAA ATCCTCAGCG
CCCGGTGGAC TGAGGGATGG CCTGATTCAC GCTGTCGCAG GCGGTGAGTA CTCGGTTCGA
GATCGCGACT ATTCCGACCG AATACGAACC GGGGATTATC TTCCCGCACT TGTCCCTCTA
ACGGCGACAG CCCCCAGTGG CTTGACGAGT TATCGGAGCT CCTCCTGGGG AGACGAGTTT
CAAGAGAACC TGTTTTCTTC ACACTTTAAT ACAGGCAAGA TTCTGCGTCA TCGACTCAAG
GCTGAAAGTG CAACATATCG TTGCGAAACG GAAGAGTTTA TCACAGCACC ACAAGGAACC
GTTCATTTCA CTGATGTTCT GGAAGATGCT GATGGAAGTC TGTTGATTGT CGATACGGGA
GGTTGGTTTA TTGCCTGTTG CCCTGCCTCC GGTTCGAGCC AGCCAACAGT CAAAGGATCA
ATTTTTCGAA TCCTTCGCAA TGCCGCGACG AAGGTTCAGG ATCCCTATGG CAATCTCATT
CCATGGAAGT CTTTACCTAC TGACGATCTG TTGGCTCGGC TTGATGATTC GCGGGTCATG
GTGCAGGAGC GTGCCATCAT CGAAGTGGCT CGACGAGAAC AACGGATGAC CGACGCCTTG
GCAACTCTTC TGACATCGCC ACAATCCAGT TTCAGGCAGC GAACAGGAGC GGTTTGGGCG
CTTTGCCGCA TGGATGACAA TGCGGCACGG GCAGCGACTC GGCTGGCCTT CAGAGATCCC
AGCCACAGTG TACGACAAGC AGCTGCCTAC TCGGCAGGTC TTCACCGTGA TCGATCGGCA
CGTCAAACGC TGGAAGCGCT ACTCGTTGAT GAATCATCAG GAGTTCGACG AGAAGCTGCC
AATGCCCTGG GCAGACTTCA ACAGAAGGAA TCCATCCCGG CTCTTCTCAA AAGCCTAGAA
CCACAACAAG TTTCACAACG GGTGACTGCT GATCGATTTC TGGAGCATGC CATTACCTTT
GCGCTCATCC AGATCAATCA TGCAGAATCG ACTCGTGCCG GCTTGTTGTC TCATTCTGCG
GATGTGCAGC GAATCACTCT GATCGCACTC GATCAAATGC CATTAACAAT ACTTGATGCG
AAAGATCTGA CGCCACTTTT AAGTTCTGGC AATACACCTC TCAGACAGGC GGCCATCCAG
GTTTTATCAC GACATCCCCA ATGGACTGCT GAATCCATCG CGCTGATTGA TGAATGGTTC
AAAAGTGACC AGATCAATGA AGATCGAGTC CAGATCATCG CAGGATTTGT GCGCACACTT
CAGCATGAGC CACAAATGCA GGAAACCATC AGCCGGAATT TTCAAGAACA GCAGTTGCGC
TCGAAAACAT CACGTCGAGC ACTGCTTATT GCCGTTTCAA AACTTGAAAA AGCGAACATA
CCTCAAGGAT GGCTTAACGG AATTGAGGAG TCGCTCGCAG CGGAGGATAC AGAGATATGC
ATGGCAGCGC TTGATGCGGT CGGGAAACTT TCGCTGCTAC CTTTGGAGAG ATCAATACGC
CGGATAGCTC AAGAACCTGC CAGGGAGCCA CAACTTCGAC TGAAAGCGCT ACGAACATTA
ACGACTCTCG ACAAGAGACT CAGTGACTCT GAATTTGAGT ATCTCGTTTC AAGACTTTCG
GCTGAGACAC CCATCCGGGA ACGCATCACG GCACTCGACG TTATTGCCAA TACTACACAC
GATGAACAGC GATTGTCTCA ATTGCTTCCA TTCGTGAAAG TTGCCAATCC GGTGGAGTTG
CCCTATCTAT TGGCTGCCTA TGTGAATTGC ACGAACAAAG AGATAATCGA ACAGTTAGTT
GAGGCCCTTG AAGTCTCTTC AGCCACTCCA ACACTCGACA TGATTGAGCA AATTCTGAAA
CCACACGGAG AGGAAGTACA ACGTGATGCG GCACCTCTTC TTGAGCGACT GAGAAAATTG
AAGAATGACC AACTCATTCG ACTGAACGAA TGGGAACAGC GAATCGAGGG ACATAAGGGT
GACAGCGAGC GAGGTCGATT GCTCTTTATG AACAAGGCTC AATGCCATTT GTGTCACGTA
ACAGATTCCC AGGATAAAGT GAACTCTCCA GCGAAGATCG GGCCTGATCT TGCTGCCATT
GGCGAAATTC GCACCCGGCG TGAACTTCTC GAAGCGATTC TTTTTCCCAG TGCGAGTTTT
GCCCGGGGGT TCGAACCGAT TGTTGTTACC TTACAGGACG GACGAGTCTG GACAGGTTTG
GCAGGGAAAG AAACGACAGA GGAGTTCATC CTCACCACGA TTCAGGACAA TAAACCCGTC
GAAAAGATGA TTCGACGCAA CGAAATCGAA GAAGTCGCTG TGGGCAGGGT TTCCGCAATG
CCCAACGGGC TCGAGCAGCC GCTCACAGCA CAAGAGTTTG CCGACCTCAT GACGTTCCTA
CAGAACTTAA GAGCTTCCAA AGTGACGAAA ACTGTCACAG AAGGTGTTGC CAATTAA
 
Protein sequence
MQRIEFDVLL APCLTVLLMI QPAWAESERY SITVPEGFDI RQAASFPLVE RPMFAALDQS 
GHLYVLDSGG SNGGDRLKNP TDVIRRLTDT NGDGIYDQST IFADKIVFGT GIACHDGAVF
ITSPPSLWRF EDTTGDGIAD QRVELVTGFA FNQSCTDDLH GTTVGPDGRI YFLPGRFHHK
VRLKDGTPLR DGVGPWLMRC RPDGSDVEFV SGAVGNPVEV AFLPNGDSFI QGTFWAKSSA
PGGLRDGLIH AVAGGEYSVR DRDYSDRIRT GDYLPALVPL TATAPSGLTS YRSSSWGDEF
QENLFSSHFN TGKILRHRLK AESATYRCET EEFITAPQGT VHFTDVLEDA DGSLLIVDTG
GWFIACCPAS GSSQPTVKGS IFRILRNAAT KVQDPYGNLI PWKSLPTDDL LARLDDSRVM
VQERAIIEVA RREQRMTDAL ATLLTSPQSS FRQRTGAVWA LCRMDDNAAR AATRLAFRDP
SHSVRQAAAY SAGLHRDRSA RQTLEALLVD ESSGVRREAA NALGRLQQKE SIPALLKSLE
PQQVSQRVTA DRFLEHAITF ALIQINHAES TRAGLLSHSA DVQRITLIAL DQMPLTILDA
KDLTPLLSSG NTPLRQAAIQ VLSRHPQWTA ESIALIDEWF KSDQINEDRV QIIAGFVRTL
QHEPQMQETI SRNFQEQQLR SKTSRRALLI AVSKLEKANI PQGWLNGIEE SLAAEDTEIC
MAALDAVGKL SLLPLERSIR RIAQEPAREP QLRLKALRTL TTLDKRLSDS EFEYLVSRLS
AETPIRERIT ALDVIANTTH DEQRLSQLLP FVKVANPVEL PYLLAAYVNC TNKEIIEQLV
EALEVSSATP TLDMIEQILK PHGEEVQRDA APLLERLRKL KNDQLIRLNE WEQRIEGHKG
DSERGRLLFM NKAQCHLCHV TDSQDKVNSP AKIGPDLAAI GEIRTRRELL EAILFPSASF
ARGFEPIVVT LQDGRVWTGL AGKETTEEFI LTTIQDNKPV EKMIRRNEIE EVAVGRVSAM
PNGLEQPLTA QEFADLMTFL QNLRASKVTK TVTEGVAN