Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_1723 |
Symbol | |
ID | 9138424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 2236578 |
End bp | 2239811 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | heme-binding protein |
Protein accession | YP_003629752 |
Protein GI | 296121974 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.789867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGGC ATCAAGTTCT CGCCAATCCT TGGCATCGTT TTCGGTTGGT CTGTTCTTGC AGCCTGTTGG TGCTGTGCGG ATCGAACATC TCTCTTCAAG CTGAGGACCC TGCTGAGAAA AAATGGTCGC AGCCTGTCGA ACGCATGCCC GTCAATGGCC GGGAGCTGAC TGCCGGTACC GATTCAAAGT TTCCGGCTGA GAATGCTCCT TGGATTTGGG GCCCATCGTA CGACTCACCT TATGTACTCA AGAAGTCGTG GGTTGTTCCC GAAGGTCTCG TGGCTGCGCA GCTTGTCGCC ACGTGCGATA ACGAGATGGA GTTGTTCCTG AATGGGAAGT CGATTGGTTC GAGCAACGAA TGGCAAACTC CCATCACCAT CCCTCTCACA GGTAAGCTCG CCAAGGGGGA AAATGTTCTG ACGGCGAAAG TCAGCAATGA AGGCGGGATC GCGGCGTTTG CCTGCCGCCT GAGTATGAAA GATGCCCAGG GAAAAGTCTC CACTATCGAG AGCGACGAGA GCTGGCAGGC TTTTTCAAGC GACGATCTTC CGAAACAGCA CCCCATCAAG CTGGTGGCCA AGCCGGGAGA AGGCCCATGG GGCCAGGTGA TGACAAATGC GAACGAAGTC AGTCCCGCAG CGAAGAGTTT TTCAGTCCCC AGCGGCTTTG AAGTCGAACG CCTGTTTGTG GTCCCCCGGG ATGAACTGGG TTCATGGGTG GCGATTACTT CGGATCCTAA GGGTCGGCTG ATTGCCAGTG ATCAAGGTGG CAAAGGCCTG GTGCGAATTA CACCAGCCCC TCTCGATGGC ACCGGTGAAA CCATCGTTGA AAAAATTCCA GTCGAACTTT CGGGAGCACA AGGGCTTCTC TGGGCTTTTG ATGCTCTCTA CGTGGTGTGC AATGGTGGGC CAGGCAGCGG GCTCTATCGA GCCACCGACA GCAACGGAGA TGATGTTCTC GACAAAGTTG AGAAACTTCG CGATCTGCAA GGTGGTGGTG AGCATGGCCC GCATAGCATT GTGCTTTCGC CCGACGGTCA AAAGCTGTTT GTGATTTGCG GCAATCACAC CAAAGTTCCG TTCAACGTCA AAGATCTTAC CCCGCCGCAA ACGATGGGGG GGATTCGTAC TGAGCAGCGA CGCGTCGAAG TCGCGGGAGA TGGTGCCAGT CGATTGCCTG CCAACTGGGA TGAAGATCAG ATCATCACTC GCATGTGGGA TGCCAATGGG CATGCCGCCG GGATTCTGGC TCCCGGTGGA TATGTCGTTT CTACAGACAA AGACGGCAAA AGCTGGGAAG TCTGGAGTGC GGGTTATCGC AATCCCTACG ACATGGCTTT CAACACTGAT GGTGAATTGT TTGTCTACGA TGCCGACATG GAGTGGGATT TTGGCACTCC CTGGTATCGG CCCACACGCG TCAACCATGC CACCAGCGGC AGTGAACTGG GGTGGCGCAG TGGCAGTGCG AAATGGCCTG CGATTTTTCC AGACAGTTTA CCCGCTCTGT ATGACATTGG TCCCGGTTCA CCGGTCGGTG TGACGTTTGG ATATGGCACT CGCTTCCCGG CCAAATACCA GCAGGCGCTG TACCTTTGTG ATTGGACGTT CGGGACGATG TACGCCATTC ATCTCACTCC GGAAGGTTCA AGTTACCGTG CCACGCGTGA GGAGTTTGTC TCTCGCACAC CTCTCCCACT GACAGATGTC ACGATTGGTC GTGATGGAGC GATGTACTTC ACGGTAGGTG GACGTGGCGG GCAGGGTGAA CTTTATCGTG TGCGCTACAG GGGAAATGAA TCCACACAGC CCGTCATGGC AAAATCCGAA GAGGGGGCCG CGCTCCGTTC TGTGAGACGC GAACTCGAAA GTTTCCACAC ATCTGCGGCC AATCCGGATC AAGCAATTCC CAAAGCTTTG GCAAATCTAG GGCATGAAGA CCGGTACATT CGCTATGCAG CCCGAGTAGC ACTTGAGCAT CAACCTGTCG CTCAATGGAA AGAAAAGGCT CTTGCCTCCA ATTCTCCACT TGCTCTGATC GAAGGGGCCA TCGCGTTGGC CCATCAGACA GATCCTTCCG ATCAGCCAGC GATCCTGAAA GCTCTTGACC AGATTGATAT TGATAAGCTT TCAGTCACCC AGAAGGTCTG GCTGCTCAGA GCCTACGAAT TGGCGATGAT TCGACTGGGC GAGCCCTCAG CCGAGTTTAA GAAGAGTTTC GCCGCCCGAT GGAATCCCCA ATTCCCCAGT GGAGAGTTCG ATCTCGACCG GCAACTCTCC TCGATGCTGG TCGCTGTCAG AGCCCCGGGA ATTGTCACCA AACTGGTCAG TCTCCTCTCA GAACAATCCA GTTCGCGTGG GCGTCCCACC AATCTGGCAC CTGATGAAAA TGCACTCAAA GAGTTGATCA CCCGTAATGC TGGCTATGGA AGTGCCGTGC GGGCATCTCT CGAACGCGGT GGTGACCTGT TACAGATTCA TTACGCCTAT GCCTTGCGAA CCATTCATGA CCGGGATGCC TGGACGATTG ATGATCGCAA GGGATATCAC GGCTGGTTCC AGCGGGCTCG TGAATGGGCC GGTGGCAACA GTTTCCGCAA GTTTCTAGTC AACATGGAGA ACGAGAGCCT TACGGGGCTC TCTGAAAACG AAAAACTGGC ACTGGAAGTT CTCGGTGCCC GTAAGCCTTA CACACCACCC CCTCTGCCAA AGCCGATGGG CCCTGGTAAA GCCTGGACAC AGGACGAGGT GATGGCTCTG GTGACGAGTG GCCGACTCGA TCGAGGTCGA AACTTCGAAA AAGGCAAACG TGCCTTTGCG GCCGCACGTT GTATTGTCTG TCATCGCTTT GGTGAAGACG GTGGAGCCAC CGGGCCCGAT ATGACACAGG TCGCTGGCCG ATTCCAACTC AAGGATCTTG TCGAAGCGAT TGTCGAACCC AGCAAGGTTG TTTCGGATCA ATACAAAGCC AGCGTAGTGG AGACAGCCGA CGGTCGCTCA CTGGTCGGGC GGATTGTGCA TGAATCGCCG ACGTCGATTC TGCTGGTGAC GGACCCCGAA GATGCGACCA AGTTTGTCGA ACTGCAGAGG AAAGATATTG AGTCAATTGC CCCAGCACAG GAATCGCTGA TGCCTAAGGG ATTACTGAGC ACTCTCAATG AGGAGGAGCT ACTGGATCTG CTGGCGTACT CGATTTCTCG AAACAATCCG CGAGACGCGA GATTCAAAAA ATAG
|
Protein sequence | MSRHQVLANP WHRFRLVCSC SLLVLCGSNI SLQAEDPAEK KWSQPVERMP VNGRELTAGT DSKFPAENAP WIWGPSYDSP YVLKKSWVVP EGLVAAQLVA TCDNEMELFL NGKSIGSSNE WQTPITIPLT GKLAKGENVL TAKVSNEGGI AAFACRLSMK DAQGKVSTIE SDESWQAFSS DDLPKQHPIK LVAKPGEGPW GQVMTNANEV SPAAKSFSVP SGFEVERLFV VPRDELGSWV AITSDPKGRL IASDQGGKGL VRITPAPLDG TGETIVEKIP VELSGAQGLL WAFDALYVVC NGGPGSGLYR ATDSNGDDVL DKVEKLRDLQ GGGEHGPHSI VLSPDGQKLF VICGNHTKVP FNVKDLTPPQ TMGGIRTEQR RVEVAGDGAS RLPANWDEDQ IITRMWDANG HAAGILAPGG YVVSTDKDGK SWEVWSAGYR NPYDMAFNTD GELFVYDADM EWDFGTPWYR PTRVNHATSG SELGWRSGSA KWPAIFPDSL PALYDIGPGS PVGVTFGYGT RFPAKYQQAL YLCDWTFGTM YAIHLTPEGS SYRATREEFV SRTPLPLTDV TIGRDGAMYF TVGGRGGQGE LYRVRYRGNE STQPVMAKSE EGAALRSVRR ELESFHTSAA NPDQAIPKAL ANLGHEDRYI RYAARVALEH QPVAQWKEKA LASNSPLALI EGAIALAHQT DPSDQPAILK ALDQIDIDKL SVTQKVWLLR AYELAMIRLG EPSAEFKKSF AARWNPQFPS GEFDLDRQLS SMLVAVRAPG IVTKLVSLLS EQSSSRGRPT NLAPDENALK ELITRNAGYG SAVRASLERG GDLLQIHYAY ALRTIHDRDA WTIDDRKGYH GWFQRAREWA GGNSFRKFLV NMENESLTGL SENEKLALEV LGARKPYTPP PLPKPMGPGK AWTQDEVMAL VTSGRLDRGR NFEKGKRAFA AARCIVCHRF GEDGGATGPD MTQVAGRFQL KDLVEAIVEP SKVVSDQYKA SVVETADGRS LVGRIVHESP TSILLVTDPE DATKFVELQR KDIESIAPAQ ESLMPKGLLS TLNEEELLDL LAYSISRNNP RDARFKK
|
| |