Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0735 |
Symbol | |
ID | 6374400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 775055 |
End bp | 778423 |
Gene Length | 3369 bp |
Protein Length | 1122 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642683244 |
Product | hypothetical protein |
Protein accession | YP_001959170 |
Protein GI | 189499700 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01633] putative phage tail component, N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00640704 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGAAG CCCTTGAATT CGGATTCATC TCAGACGACC GCCTCGCCGG CTTCCGCCTC CAGCGCCTCG AAGTGTTCAA CTGGGGCACC TTCGACGGAC GAGTCTGGAC ACTCCGGCTC GACGGGCGGA ACAGTCTCCT CACAGGCGAT ATCGGTTCAG GCAAATCAAC ACTTGTCGAT GCTGTAACCA CGTTACTGGT ACCAGGCCAG CGTATCGCCT ACAACAAGGC AGCCGGAGCC GAAACGCGGG AACGCTCGCT TCGCTCCTAC GTGCTCGGTT TCTACAAATC CGAGCGGCAG GAGTCCCTCG GAGGCGGCAC CAAACCGGTT GCCCTGCGCG ACAGCAACGC CTACTCTGTG GTGCTCGGCG TCTTCCACAA TGAAGGCTAC GACAAAACCG TAACCCTCGC CCAGCTCTTC TGGATGAAGG ATCCGACAGG GCAACCTGCC CGCCTTTATG CTGCCGCAGA ACGCGACCTC TCCATCGCCA ACGACTTCTC CGGCTTCGGC ACTGAGATCA CCCCCCTGCG CAAGCACCTG CGAGCAAAAG GCGTCGAGGT GTTCGACAGC TTTCCGAAAT ACGGCGCCTG GTTCCGCCGT CGATTGAGTA TCAACAACGA ACAGGCGCTG GAGCTGTTTC ACCAGACAGT TTCTCTCAAA TCGGTGGGCA ACCTGACCGA GTTCGTGCGC AGCCACATGC TCGAACCTTT CGACGTCCAG CCGCGCATCG CAGCCCTGAT CCGCCACTTC GAAGACCTCA ACCGTGCACA TGAAGCTGTA CTCAAGGCGA AACGGCAGAT CGAAAAGCTT ACCCCGCTCG TAGAGGATTG CGACCGCCAC CGGGAGATCA ACGCCTCCAC CGAAGAGTTG CGTGGTTGCC GCGAAGCACT GCGCCCCTGG TTCGCATCTC TCAAGCTGCA ACTGCTCGAA CACCGCCTCG CAAGCCTTAA AGAGGAGATG GCCCGCCATG AGAGCGCGGT CGAACGACTT GAACAGCAAC GACGCGAAGG GCAAATGCGC GAACGAAACC TCCGTCGCAC CATTGCCGAA AACGGTGGCG ACCGGATCGA AAGCATTGCC TCCGAAATCA ACACAAAGCA GGAAGAACTC GACCGGAAAA AGCAAAAATC CTCCCGCTAC GGCGATCTGG TACTCCAACT CGGTGAGGCA CAGGCAACCA ACGTCGAAGA GTTCTTCCGC CAGCGTGCCG GACACGAAAC CATGCGTGAA GAGATTGAAG AGCGAGAGGT GAGCGTCCAG AACGACCTCA ACGAAACCGG CGTCTCTGTC GCCAGTATGC AACAGGAGTA CCGGGAACTG ACAGCCGAAA TCAAGAGTCT CAAGGCCCGT CAAAACAACA TCGATGACAG ACAGATCGCC ATGCGTCGCG AACTTTGTCA GGCGCTCGAA ATAGACGAGG AAATGATGCC CTTCGCCGGC GAACTGCTCC AGGTGCGAGA AGAGGAAACT GCTTGGGAAG GGGTCATCGA ACGGCTGCTG CGCAACTTCT CCCTCTCGCT TCTCGTGCCT GAGCATCACT ACGCACAGGT GGCTGAGTGG GTTGACCGCA CCCACCTCAA GGGACGCCTC GTCTACTTCC GGGTGCGCCC CCCGGCACGG CGCGAAACCA TGCCCGACAA TCCGGGCTCA CTGCCCCGCA AGCTCATGAT CAAGACCGAC TCTCCTCACT TCCACTGGCT CGAACGTGAG ATCGGCCACC GCTTCGACGC AGTCTGCTGC GACAGCCAGG AGGAATTCCG ACGAGAGAAA AAAGCGATCA CGATGGCCGG ACAGGTCAAA ATGCCCGGCG AGCGTCACGA AAAAGACGAC CGCCACCGCC TCGACGATCG TAGCCGCTAC GTACTCGGCT GGAGCAACGC AGCCAAGATA GCCGTGCTCG AAAAAAACGC CCGACAGAAA CGTGACGAAC TCGGCGAACT CAACCACCGC ATGAAAGTCA TGCAACAGGA GCAATCGACC CTGAAAGAGC GCATTACGAT CCTCTCTCGC CTCGACGAGT ACCCCGACTT CCACGATCTC GACTGGCAGC CGGTAGCCGT CGCCATCGCC CGGCTTGAAG CCGAAAAACG CGAGCTTGAG AGCACTTCCG ACAAGCTCCG CACCCTCACC ATGCAACTCG ATGAGGTAGA AAAAGGACTC GAGAAGACCG AGCGCCTCCT CGACGAACGC AAGGACAAGC GATCCAGAAC CAAGGAGAAG ATCAGCAGCG CAAGAGAGCT TGAAGAACAG GCCCTGCTGT TCGTCAACGA AGCAGCATCA GGGACAACCG ACCGCTTTTC CCGCCTCAAA GCCCTTCAGT CTGAGATGCA GGAGAGCCGC GTTCTTACCG TTGAATCGTG CGACAACCGC GAGCGCGAAA TGCGCGACTG GCTGCAGGCA AGAATCGACG CGGAAAACAA AAAGCTCTCG CGCCTGAACG AACAGATCAT CCGTGCGATG ACAGAGTACA GCGAAAAGTG GAAGCTCGAA ACCCGCGAGG TCGATATCAA TATCGCCTCT GCTGACGAGT ACCGCTCAAT GCTACAAGAG CTCCGTAAAG ACGACCTGCC GCGCTTCGAA GAAAAATTCA AGGAACTGCT CAACGAAAAC ACCATCCGCG AGGTAGCCAA CTTCCAGTCA CAGCTCACCC GTGAGCGCGA AACCATCAGG GAGCGTATCG CCCGGATCAA CGAATCGCTC ACGAAAATCG ACTACAACCC CGGCCGCTAC ATCAGCCTCG AAGCGCAGAT AAACCTCGAT GCCGACATCC GCGAGTTCCA GGCTGAGCTT CGAAGCTGCA CTGAGGGCAC ACTGACCGGT TCGGACAACG CGCAGTATTC CGAGGCAAAG TTCCTGCAAG TACGCAAGAT TATCGACCGG TTCCGGGGAC GCGAAGAGTT CGCCGACCTC GACCGCCGCT GGACGGTCAA GGTTACCGAC GTGCGAAACT GGTTCGTCTT CGCGGCCAGC GAAAGATGGC GTGAGGACGA CACCGAGCAT GAACACTACG CCGATTCGGG AGGCAAATCG GGCGGACAGA AGGAGAAGCT CGCCTACACG GTGCTCGCCG CCAGTCTCGC CTACCAGTTC GGTCTCGAGT GGGGAGAGGT TCAGTCGCGG TCATTCCGCT TCGTGGTCAT AGACGAGGCA TTCGGACGCG GATCGGACGA ATCGGCCAAT TACGGTCTGC AGCTCTTCGA GCAGCTCAAC CTGCAACTGC TCATCGTCAC TCCCCTGCAG AAGATCCACA TCATTGAACC CTTCGTCTCC AGTGTCGGAT TCGTTCACAA CGAGGACGGC CGGAACTCCG TGCTGCGCAA CCTCAGCATC GAAGAGTACC GCACCGAAAA ACGAAACATG CAGAAATGA
|
Protein sequence | MTEALEFGFI SDDRLAGFRL QRLEVFNWGT FDGRVWTLRL DGRNSLLTGD IGSGKSTLVD AVTTLLVPGQ RIAYNKAAGA ETRERSLRSY VLGFYKSERQ ESLGGGTKPV ALRDSNAYSV VLGVFHNEGY DKTVTLAQLF WMKDPTGQPA RLYAAAERDL SIANDFSGFG TEITPLRKHL RAKGVEVFDS FPKYGAWFRR RLSINNEQAL ELFHQTVSLK SVGNLTEFVR SHMLEPFDVQ PRIAALIRHF EDLNRAHEAV LKAKRQIEKL TPLVEDCDRH REINASTEEL RGCREALRPW FASLKLQLLE HRLASLKEEM ARHESAVERL EQQRREGQMR ERNLRRTIAE NGGDRIESIA SEINTKQEEL DRKKQKSSRY GDLVLQLGEA QATNVEEFFR QRAGHETMRE EIEEREVSVQ NDLNETGVSV ASMQQEYREL TAEIKSLKAR QNNIDDRQIA MRRELCQALE IDEEMMPFAG ELLQVREEET AWEGVIERLL RNFSLSLLVP EHHYAQVAEW VDRTHLKGRL VYFRVRPPAR RETMPDNPGS LPRKLMIKTD SPHFHWLERE IGHRFDAVCC DSQEEFRREK KAITMAGQVK MPGERHEKDD RHRLDDRSRY VLGWSNAAKI AVLEKNARQK RDELGELNHR MKVMQQEQST LKERITILSR LDEYPDFHDL DWQPVAVAIA RLEAEKRELE STSDKLRTLT MQLDEVEKGL EKTERLLDER KDKRSRTKEK ISSARELEEQ ALLFVNEAAS GTTDRFSRLK ALQSEMQESR VLTVESCDNR EREMRDWLQA RIDAENKKLS RLNEQIIRAM TEYSEKWKLE TREVDINIAS ADEYRSMLQE LRKDDLPRFE EKFKELLNEN TIREVANFQS QLTRERETIR ERIARINESL TKIDYNPGRY ISLEAQINLD ADIREFQAEL RSCTEGTLTG SDNAQYSEAK FLQVRKIIDR FRGREEFADL DRRWTVKVTD VRNWFVFAAS ERWREDDTEH EHYADSGGKS GGQKEKLAYT VLAASLAYQF GLEWGEVQSR SFRFVVIDEA FGRGSDESAN YGLQLFEQLN LQLLIVTPLQ KIHIIEPFVS SVGFVHNEDG RNSVLRNLSI EEYRTEKRNM QK
|
| |