Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2102 |
Symbol | |
ID | 6375796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2273180 |
End bp | 2275186 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642684593 |
Product | transketolase |
Protein accession | YP_001960492 |
Protein GI | 189501022 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000131444 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.204144 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATACAG ACCCAATCGA TCAGCTGGCG ATCAATACCG TCCGCATGTT AGCTGTTGAC ATGGTGGAAA AAGCGCGGTC AGGACATCCG GGAATGCCTA TGGGAGCGGC TCCCATGGCA TATGTGCTCT GGACAAAGAT CATGAAGCAC AATCCCGACA ATCCTGAATG GATCAACCGG GACCGATTCG TACTGTCCGC TGGCCACGGC TCGGCGCTTC TGTATTCACT GCTGCACCTT ACAGGTTACG ACCTCTCCAT GGATGACCTC AGACAGTTTC GCCAGTGGGG AAGCAAAACC CCCGGGCATC CTGAATACGG CCACACTCCC GGCGTCGAGA CGACAACAGG CCCTCTTGGC CAGGGCCTTT CCAACGCTGT CGGAATGGCA ATTGCCGAGC GATACTGCGC TACGCGTCTC AATAAACCCG ACATGGAACT GATCGACTAC TCTACCTACG TCATATGCGG TGACGGAGAC CTGATGGAAG GCATTACCTC CGAGGCGGCA TCCATTGCCG GACATCTCCG CCTGGGCAAG CTGATCTGCA TGTATGATCA CAACCGCATA TCCATTGAAG GGTCGACCGA CCTTGCCTTT ACAGAAAGCG TGCACCAGAG GTTCGAAGCA TATGGATGGC ATGTGGTGGA AATCGACGGC AACGACCCTG AAGCTATCGA GGAAGCGTTA CATGCCGCCC GTCAGGTAAC CGGGAAACCC TCGATGATTA TCGCCAAAAC CAACATAGGC TTCGGGAGCC CTAACAAGCA GGACAGTGCC TCGTCGCACG GCGCCCCTCT TGGAGCTGAA GAGGTCGCGC TGGTAAGAAA ATATTTCGGA TTCCCTGAAG AGAGCTCCTT CTTTGTCCCT GAATCGGTAG CCGCCCACAT GAGCGCCGTG TGTGAAAAAG GGAGCCGTTC TGAAACAACA TGGAATGAAC TGTTCAACAC ATACGGTAAA AGCCATCCCG AACTCGCTGA GGAAATGGAA ACCATGCTCC GCAACGAGCT GCCTGAAGGA TGGGAAACAT TACTTCCTCA ATTCAGCCCT GAAGAAAAAC TTGCAACCCG TCAGGCATCT TCCAGGGTAC TGCATGCGCT GGTAGGAAAA ATCCCGTTTT TGGTGGGTGG TTCAGCCGAC CTCGCACCAT CAACCGGTAC AGAAGTCAAA CATGCTACCG ATTTCACCTC TGAAAATTAT GGCGGAGCTA TTTTTCGATT CGGCGTCAGG GAACATGCCA TGGGCGCCAT TATCAACGGC ATGGCCCTCT CCCGTATTCT CATTCCCTAC GGAGCGACCT TCCTTGTTTT CGCGGATTAT ATGAAACCCG CTCTTCGCCT CGCAGCTATC ATGCAGGTCC CGTCTATTTT CATATTCACT CATGACAGTA TAGCTGTCGG GGAAGACGGT CCGACACATC AGCCGATCGA ACAGCTGGCC ATGATGCGTT CAATACCGGG CCTGACCGTT ATCCGCCCGG CAGATGCGCA GGAGACAAAA GCGGCTTGGT ACATCGCCCT GACGCAGAAC AAACCTACGG TGCTTGTCTT TTCAAGACAG ACACTCCCGG TACTCGACCA GGAGAAATAC CCTGTCGTGA AAGGAACCCC CAAAGGAGCC TACATACTTT CCGAATGGAG CGCCCCGTCG ACAGATGGCA ACAGACCGGT AATACTTATC GCGACAGGCG CTGAAGTTCA CCTTGCCCTT GAAGCACAGA GCGCTCTTCT CAACGCGGGC GTTCCGGCCA GAGTTGTTTC CATGCCTTCT CGAGAACTGT TCGAACAGCA GCCCGAGTCT TACCGAAACG AGGTTTTGCC GCCTTCAATA CGACGGAGAA TCGTCATTGA AGCCGCGTCT CCTTTCGGAT GGGACAAGTA CGCAACAGAT GAAGGGAGCA TTCTGGGCAT AAACCGTTTC GGAACGTCCG CCCCGGGAAA CACGGTATTG CGTGAATACG GTTTCAGCGC CGCCGCTATT GTCGAAGCCG CGAAAAACCT GCAATAG
|
Protein sequence | MHTDPIDQLA INTVRMLAVD MVEKARSGHP GMPMGAAPMA YVLWTKIMKH NPDNPEWINR DRFVLSAGHG SALLYSLLHL TGYDLSMDDL RQFRQWGSKT PGHPEYGHTP GVETTTGPLG QGLSNAVGMA IAERYCATRL NKPDMELIDY STYVICGDGD LMEGITSEAA SIAGHLRLGK LICMYDHNRI SIEGSTDLAF TESVHQRFEA YGWHVVEIDG NDPEAIEEAL HAARQVTGKP SMIIAKTNIG FGSPNKQDSA SSHGAPLGAE EVALVRKYFG FPEESSFFVP ESVAAHMSAV CEKGSRSETT WNELFNTYGK SHPELAEEME TMLRNELPEG WETLLPQFSP EEKLATRQAS SRVLHALVGK IPFLVGGSAD LAPSTGTEVK HATDFTSENY GGAIFRFGVR EHAMGAIING MALSRILIPY GATFLVFADY MKPALRLAAI MQVPSIFIFT HDSIAVGEDG PTHQPIEQLA MMRSIPGLTV IRPADAQETK AAWYIALTQN KPTVLVFSRQ TLPVLDQEKY PVVKGTPKGA YILSEWSAPS TDGNRPVILI ATGAEVHLAL EAQSALLNAG VPARVVSMPS RELFEQQPES YRNEVLPPSI RRRIVIEAAS PFGWDKYATD EGSILGINRF GTSAPGNTVL REYGFSAAAI VEAAKNLQ
|
| |