Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_1436 |
Symbol | |
ID | 9138131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 1840833 |
End bp | 1844009 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | heme-binding protein |
Protein accession | YP_003629469 |
Protein GI | 296121691 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGCA TAGAGTTTGA CGTTCTTCTG GCACCGTGCC TGACAGTTCT TCTGATGATT CAACCGGCCT GGGCTGAATC CGAACGCTAT TCCATCACAG TGCCCGAAGG CTTTGACATT CGACAGGCGG CCAGTTTTCC GCTCGTCGAA CGCCCAATGT TTGCAGCACT TGATCAGTCA GGACATCTCT ATGTTCTTGA TTCGGGTGGA TCGAATGGTG GAGATCGATT GAAAAACCCC ACCGATGTCA TTCGTCGCCT GACGGATACC AATGGCGACG GTATCTACGA TCAAAGCACA ATTTTTGCTG ATAAAATCGT GTTTGGCACA GGGATTGCCT GCCACGATGG AGCCGTATTC ATCACATCTC CACCCAGCCT CTGGAGGTTT GAAGATACCA CTGGCGACGG AATTGCCGAT CAGCGTGTAG AGCTCGTGAC TGGATTTGCT TTCAATCAAA GTTGTACTGA TGATCTGCAT GGCACGACTG TGGGGCCAGA TGGTCGAATC TATTTTTTAC CTGGTCGATT TCACCATAAA GTCCGCCTGA AGGACGGTAC GCCTCTACGA GATGGTGTTG GCCCGTGGCT GATGCGCTGC CGACCGGATG GAAGTGATGT CGAGTTTGTC TCTGGTGCAG TAGGAAATCC CGTCGAAGTG GCTTTTCTCC CCAATGGTGA TTCATTTATT CAGGGGACAT TCTGGGCGAA ATCCTCAGCG CCCGGTGGAC TGAGGGATGG CCTGATTCAC GCTGTCGCAG GCGGTGAGTA CTCGGTTCGA GATCGCGACT ATTCCGACCG AATACGAACC GGGGATTATC TTCCCGCACT TGTCCCTCTA ACGGCGACAG CCCCCAGTGG CTTGACGAGT TATCGGAGCT CCTCCTGGGG AGACGAGTTT CAAGAGAACC TGTTTTCTTC ACACTTTAAT ACAGGCAAGA TTCTGCGTCA TCGACTCAAG GCTGAAAGTG CAACATATCG TTGCGAAACG GAAGAGTTTA TCACAGCACC ACAAGGAACC GTTCATTTCA CTGATGTTCT GGAAGATGCT GATGGAAGTC TGTTGATTGT CGATACGGGA GGTTGGTTTA TTGCCTGTTG CCCTGCCTCC GGTTCGAGCC AGCCAACAGT CAAAGGATCA ATTTTTCGAA TCCTTCGCAA TGCCGCGACG AAGGTTCAGG ATCCCTATGG CAATCTCATT CCATGGAAGT CTTTACCTAC TGACGATCTG TTGGCTCGGC TTGATGATTC GCGGGTCATG GTGCAGGAGC GTGCCATCAT CGAAGTGGCT CGACGAGAAC AACGGATGAC CGACGCCTTG GCAACTCTTC TGACATCGCC ACAATCCAGT TTCAGGCAGC GAACAGGAGC GGTTTGGGCG CTTTGCCGCA TGGATGACAA TGCGGCACGG GCAGCGACTC GGCTGGCCTT CAGAGATCCC AGCCACAGTG TACGACAAGC AGCTGCCTAC TCGGCAGGTC TTCACCGTGA TCGATCGGCA CGTCAAACGC TGGAAGCGCT ACTCGTTGAT GAATCATCAG GAGTTCGACG AGAAGCTGCC AATGCCCTGG GCAGACTTCA ACAGAAGGAA TCCATCCCGG CTCTTCTCAA AAGCCTAGAA CCACAACAAG TTTCACAACG GGTGACTGCT GATCGATTTC TGGAGCATGC CATTACCTTT GCGCTCATCC AGATCAATCA TGCAGAATCG ACTCGTGCCG GCTTGTTGTC TCATTCTGCG GATGTGCAGC GAATCACTCT GATCGCACTC GATCAAATGC CATTAACAAT ACTTGATGCG AAAGATCTGA CGCCACTTTT AAGTTCTGGC AATACACCTC TCAGACAGGC GGCCATCCAG GTTTTATCAC GACATCCCCA ATGGACTGCT GAATCCATCG CGCTGATTGA TGAATGGTTC AAAAGTGACC AGATCAATGA AGATCGAGTC CAGATCATCG CAGGATTTGT GCGCACACTT CAGCATGAGC CACAAATGCA GGAAACCATC AGCCGGAATT TTCAAGAACA GCAGTTGCGC TCGAAAACAT CACGTCGAGC ACTGCTTATT GCCGTTTCAA AACTTGAAAA AGCGAACATA CCTCAAGGAT GGCTTAACGG AATTGAGGAG TCGCTCGCAG CGGAGGATAC AGAGATATGC ATGGCAGCGC TTGATGCGGT CGGGAAACTT TCGCTGCTAC CTTTGGAGAG ATCAATACGC CGGATAGCTC AAGAACCTGC CAGGGAGCCA CAACTTCGAC TGAAAGCGCT ACGAACATTA ACGACTCTCG ACAAGAGACT CAGTGACTCT GAATTTGAGT ATCTCGTTTC AAGACTTTCG GCTGAGACAC CCATCCGGGA ACGCATCACG GCACTCGACG TTATTGCCAA TACTACACAC GATGAACAGC GATTGTCTCA ATTGCTTCCA TTCGTGAAAG TTGCCAATCC GGTGGAGTTG CCCTATCTAT TGGCTGCCTA TGTGAATTGC ACGAACAAAG AGATAATCGA ACAGTTAGTT GAGGCCCTTG AAGTCTCTTC AGCCACTCCA ACACTCGACA TGATTGAGCA AATTCTGAAA CCACACGGAG AGGAAGTACA ACGTGATGCG GCACCTCTTC TTGAGCGACT GAGAAAATTG AAGAATGACC AACTCATTCG ACTGAACGAA TGGGAACAGC GAATCGAGGG ACATAAGGGT GACAGCGAGC GAGGTCGATT GCTCTTTATG AACAAGGCTC AATGCCATTT GTGTCACGTA ACAGATTCCC AGGATAAAGT GAACTCTCCA GCGAAGATCG GGCCTGATCT TGCTGCCATT GGCGAAATTC GCACCCGGCG TGAACTTCTC GAAGCGATTC TTTTTCCCAG TGCGAGTTTT GCCCGGGGGT TCGAACCGAT TGTTGTTACC TTACAGGACG GACGAGTCTG GACAGGTTTG GCAGGGAAAG AAACGACAGA GGAGTTCATC CTCACCACGA TTCAGGACAA TAAACCCGTC GAAAAGATGA TTCGACGCAA CGAAATCGAA GAAGTCGCTG TGGGCAGGGT TTCCGCAATG CCCAACGGGC TCGAGCAGCC GCTCACAGCA CAAGAGTTTG CCGACCTCAT GACGTTCCTA CAGAACTTAA GAGCTTCCAA AGTGACGAAA ACTGTCACAG AAGGTGTTGC CAATTAA
|
Protein sequence | MQRIEFDVLL APCLTVLLMI QPAWAESERY SITVPEGFDI RQAASFPLVE RPMFAALDQS GHLYVLDSGG SNGGDRLKNP TDVIRRLTDT NGDGIYDQST IFADKIVFGT GIACHDGAVF ITSPPSLWRF EDTTGDGIAD QRVELVTGFA FNQSCTDDLH GTTVGPDGRI YFLPGRFHHK VRLKDGTPLR DGVGPWLMRC RPDGSDVEFV SGAVGNPVEV AFLPNGDSFI QGTFWAKSSA PGGLRDGLIH AVAGGEYSVR DRDYSDRIRT GDYLPALVPL TATAPSGLTS YRSSSWGDEF QENLFSSHFN TGKILRHRLK AESATYRCET EEFITAPQGT VHFTDVLEDA DGSLLIVDTG GWFIACCPAS GSSQPTVKGS IFRILRNAAT KVQDPYGNLI PWKSLPTDDL LARLDDSRVM VQERAIIEVA RREQRMTDAL ATLLTSPQSS FRQRTGAVWA LCRMDDNAAR AATRLAFRDP SHSVRQAAAY SAGLHRDRSA RQTLEALLVD ESSGVRREAA NALGRLQQKE SIPALLKSLE PQQVSQRVTA DRFLEHAITF ALIQINHAES TRAGLLSHSA DVQRITLIAL DQMPLTILDA KDLTPLLSSG NTPLRQAAIQ VLSRHPQWTA ESIALIDEWF KSDQINEDRV QIIAGFVRTL QHEPQMQETI SRNFQEQQLR SKTSRRALLI AVSKLEKANI PQGWLNGIEE SLAAEDTEIC MAALDAVGKL SLLPLERSIR RIAQEPAREP QLRLKALRTL TTLDKRLSDS EFEYLVSRLS AETPIRERIT ALDVIANTTH DEQRLSQLLP FVKVANPVEL PYLLAAYVNC TNKEIIEQLV EALEVSSATP TLDMIEQILK PHGEEVQRDA APLLERLRKL KNDQLIRLNE WEQRIEGHKG DSERGRLLFM NKAQCHLCHV TDSQDKVNSP AKIGPDLAAI GEIRTRRELL EAILFPSASF ARGFEPIVVT LQDGRVWTGL AGKETTEEFI LTTIQDNKPV EKMIRRNEIE EVAVGRVSAM PNGLEQPLTA QEFADLMTFL QNLRASKVTK TVTEGVAN
|
| |