Gene Plim_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1723 
Symbol 
ID9138424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2236578 
End bp2239811 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content55% 
IMG OID 
Productheme-binding protein 
Protein accessionYP_003629752 
Protein GI296121974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.789867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGGC ATCAAGTTCT CGCCAATCCT TGGCATCGTT TTCGGTTGGT CTGTTCTTGC 
AGCCTGTTGG TGCTGTGCGG ATCGAACATC TCTCTTCAAG CTGAGGACCC TGCTGAGAAA
AAATGGTCGC AGCCTGTCGA ACGCATGCCC GTCAATGGCC GGGAGCTGAC TGCCGGTACC
GATTCAAAGT TTCCGGCTGA GAATGCTCCT TGGATTTGGG GCCCATCGTA CGACTCACCT
TATGTACTCA AGAAGTCGTG GGTTGTTCCC GAAGGTCTCG TGGCTGCGCA GCTTGTCGCC
ACGTGCGATA ACGAGATGGA GTTGTTCCTG AATGGGAAGT CGATTGGTTC GAGCAACGAA
TGGCAAACTC CCATCACCAT CCCTCTCACA GGTAAGCTCG CCAAGGGGGA AAATGTTCTG
ACGGCGAAAG TCAGCAATGA AGGCGGGATC GCGGCGTTTG CCTGCCGCCT GAGTATGAAA
GATGCCCAGG GAAAAGTCTC CACTATCGAG AGCGACGAGA GCTGGCAGGC TTTTTCAAGC
GACGATCTTC CGAAACAGCA CCCCATCAAG CTGGTGGCCA AGCCGGGAGA AGGCCCATGG
GGCCAGGTGA TGACAAATGC GAACGAAGTC AGTCCCGCAG CGAAGAGTTT TTCAGTCCCC
AGCGGCTTTG AAGTCGAACG CCTGTTTGTG GTCCCCCGGG ATGAACTGGG TTCATGGGTG
GCGATTACTT CGGATCCTAA GGGTCGGCTG ATTGCCAGTG ATCAAGGTGG CAAAGGCCTG
GTGCGAATTA CACCAGCCCC TCTCGATGGC ACCGGTGAAA CCATCGTTGA AAAAATTCCA
GTCGAACTTT CGGGAGCACA AGGGCTTCTC TGGGCTTTTG ATGCTCTCTA CGTGGTGTGC
AATGGTGGGC CAGGCAGCGG GCTCTATCGA GCCACCGACA GCAACGGAGA TGATGTTCTC
GACAAAGTTG AGAAACTTCG CGATCTGCAA GGTGGTGGTG AGCATGGCCC GCATAGCATT
GTGCTTTCGC CCGACGGTCA AAAGCTGTTT GTGATTTGCG GCAATCACAC CAAAGTTCCG
TTCAACGTCA AAGATCTTAC CCCGCCGCAA ACGATGGGGG GGATTCGTAC TGAGCAGCGA
CGCGTCGAAG TCGCGGGAGA TGGTGCCAGT CGATTGCCTG CCAACTGGGA TGAAGATCAG
ATCATCACTC GCATGTGGGA TGCCAATGGG CATGCCGCCG GGATTCTGGC TCCCGGTGGA
TATGTCGTTT CTACAGACAA AGACGGCAAA AGCTGGGAAG TCTGGAGTGC GGGTTATCGC
AATCCCTACG ACATGGCTTT CAACACTGAT GGTGAATTGT TTGTCTACGA TGCCGACATG
GAGTGGGATT TTGGCACTCC CTGGTATCGG CCCACACGCG TCAACCATGC CACCAGCGGC
AGTGAACTGG GGTGGCGCAG TGGCAGTGCG AAATGGCCTG CGATTTTTCC AGACAGTTTA
CCCGCTCTGT ATGACATTGG TCCCGGTTCA CCGGTCGGTG TGACGTTTGG ATATGGCACT
CGCTTCCCGG CCAAATACCA GCAGGCGCTG TACCTTTGTG ATTGGACGTT CGGGACGATG
TACGCCATTC ATCTCACTCC GGAAGGTTCA AGTTACCGTG CCACGCGTGA GGAGTTTGTC
TCTCGCACAC CTCTCCCACT GACAGATGTC ACGATTGGTC GTGATGGAGC GATGTACTTC
ACGGTAGGTG GACGTGGCGG GCAGGGTGAA CTTTATCGTG TGCGCTACAG GGGAAATGAA
TCCACACAGC CCGTCATGGC AAAATCCGAA GAGGGGGCCG CGCTCCGTTC TGTGAGACGC
GAACTCGAAA GTTTCCACAC ATCTGCGGCC AATCCGGATC AAGCAATTCC CAAAGCTTTG
GCAAATCTAG GGCATGAAGA CCGGTACATT CGCTATGCAG CCCGAGTAGC ACTTGAGCAT
CAACCTGTCG CTCAATGGAA AGAAAAGGCT CTTGCCTCCA ATTCTCCACT TGCTCTGATC
GAAGGGGCCA TCGCGTTGGC CCATCAGACA GATCCTTCCG ATCAGCCAGC GATCCTGAAA
GCTCTTGACC AGATTGATAT TGATAAGCTT TCAGTCACCC AGAAGGTCTG GCTGCTCAGA
GCCTACGAAT TGGCGATGAT TCGACTGGGC GAGCCCTCAG CCGAGTTTAA GAAGAGTTTC
GCCGCCCGAT GGAATCCCCA ATTCCCCAGT GGAGAGTTCG ATCTCGACCG GCAACTCTCC
TCGATGCTGG TCGCTGTCAG AGCCCCGGGA ATTGTCACCA AACTGGTCAG TCTCCTCTCA
GAACAATCCA GTTCGCGTGG GCGTCCCACC AATCTGGCAC CTGATGAAAA TGCACTCAAA
GAGTTGATCA CCCGTAATGC TGGCTATGGA AGTGCCGTGC GGGCATCTCT CGAACGCGGT
GGTGACCTGT TACAGATTCA TTACGCCTAT GCCTTGCGAA CCATTCATGA CCGGGATGCC
TGGACGATTG ATGATCGCAA GGGATATCAC GGCTGGTTCC AGCGGGCTCG TGAATGGGCC
GGTGGCAACA GTTTCCGCAA GTTTCTAGTC AACATGGAGA ACGAGAGCCT TACGGGGCTC
TCTGAAAACG AAAAACTGGC ACTGGAAGTT CTCGGTGCCC GTAAGCCTTA CACACCACCC
CCTCTGCCAA AGCCGATGGG CCCTGGTAAA GCCTGGACAC AGGACGAGGT GATGGCTCTG
GTGACGAGTG GCCGACTCGA TCGAGGTCGA AACTTCGAAA AAGGCAAACG TGCCTTTGCG
GCCGCACGTT GTATTGTCTG TCATCGCTTT GGTGAAGACG GTGGAGCCAC CGGGCCCGAT
ATGACACAGG TCGCTGGCCG ATTCCAACTC AAGGATCTTG TCGAAGCGAT TGTCGAACCC
AGCAAGGTTG TTTCGGATCA ATACAAAGCC AGCGTAGTGG AGACAGCCGA CGGTCGCTCA
CTGGTCGGGC GGATTGTGCA TGAATCGCCG ACGTCGATTC TGCTGGTGAC GGACCCCGAA
GATGCGACCA AGTTTGTCGA ACTGCAGAGG AAAGATATTG AGTCAATTGC CCCAGCACAG
GAATCGCTGA TGCCTAAGGG ATTACTGAGC ACTCTCAATG AGGAGGAGCT ACTGGATCTG
CTGGCGTACT CGATTTCTCG AAACAATCCG CGAGACGCGA GATTCAAAAA ATAG
 
Protein sequence
MSRHQVLANP WHRFRLVCSC SLLVLCGSNI SLQAEDPAEK KWSQPVERMP VNGRELTAGT 
DSKFPAENAP WIWGPSYDSP YVLKKSWVVP EGLVAAQLVA TCDNEMELFL NGKSIGSSNE
WQTPITIPLT GKLAKGENVL TAKVSNEGGI AAFACRLSMK DAQGKVSTIE SDESWQAFSS
DDLPKQHPIK LVAKPGEGPW GQVMTNANEV SPAAKSFSVP SGFEVERLFV VPRDELGSWV
AITSDPKGRL IASDQGGKGL VRITPAPLDG TGETIVEKIP VELSGAQGLL WAFDALYVVC
NGGPGSGLYR ATDSNGDDVL DKVEKLRDLQ GGGEHGPHSI VLSPDGQKLF VICGNHTKVP
FNVKDLTPPQ TMGGIRTEQR RVEVAGDGAS RLPANWDEDQ IITRMWDANG HAAGILAPGG
YVVSTDKDGK SWEVWSAGYR NPYDMAFNTD GELFVYDADM EWDFGTPWYR PTRVNHATSG
SELGWRSGSA KWPAIFPDSL PALYDIGPGS PVGVTFGYGT RFPAKYQQAL YLCDWTFGTM
YAIHLTPEGS SYRATREEFV SRTPLPLTDV TIGRDGAMYF TVGGRGGQGE LYRVRYRGNE
STQPVMAKSE EGAALRSVRR ELESFHTSAA NPDQAIPKAL ANLGHEDRYI RYAARVALEH
QPVAQWKEKA LASNSPLALI EGAIALAHQT DPSDQPAILK ALDQIDIDKL SVTQKVWLLR
AYELAMIRLG EPSAEFKKSF AARWNPQFPS GEFDLDRQLS SMLVAVRAPG IVTKLVSLLS
EQSSSRGRPT NLAPDENALK ELITRNAGYG SAVRASLERG GDLLQIHYAY ALRTIHDRDA
WTIDDRKGYH GWFQRAREWA GGNSFRKFLV NMENESLTGL SENEKLALEV LGARKPYTPP
PLPKPMGPGK AWTQDEVMAL VTSGRLDRGR NFEKGKRAFA AARCIVCHRF GEDGGATGPD
MTQVAGRFQL KDLVEAIVEP SKVVSDQYKA SVVETADGRS LVGRIVHESP TSILLVTDPE
DATKFVELQR KDIESIAPAQ ESLMPKGLLS TLNEEELLDL LAYSISRNNP RDARFKK