Gene Cphamn1_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2102 
Symbol 
ID6375796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2273180 
End bp2275186 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content54% 
IMG OID642684593 
Producttransketolase 
Protein accessionYP_001960492 
Protein GI189501022 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000131444 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.204144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACAG ACCCAATCGA TCAGCTGGCG ATCAATACCG TCCGCATGTT AGCTGTTGAC 
ATGGTGGAAA AAGCGCGGTC AGGACATCCG GGAATGCCTA TGGGAGCGGC TCCCATGGCA
TATGTGCTCT GGACAAAGAT CATGAAGCAC AATCCCGACA ATCCTGAATG GATCAACCGG
GACCGATTCG TACTGTCCGC TGGCCACGGC TCGGCGCTTC TGTATTCACT GCTGCACCTT
ACAGGTTACG ACCTCTCCAT GGATGACCTC AGACAGTTTC GCCAGTGGGG AAGCAAAACC
CCCGGGCATC CTGAATACGG CCACACTCCC GGCGTCGAGA CGACAACAGG CCCTCTTGGC
CAGGGCCTTT CCAACGCTGT CGGAATGGCA ATTGCCGAGC GATACTGCGC TACGCGTCTC
AATAAACCCG ACATGGAACT GATCGACTAC TCTACCTACG TCATATGCGG TGACGGAGAC
CTGATGGAAG GCATTACCTC CGAGGCGGCA TCCATTGCCG GACATCTCCG CCTGGGCAAG
CTGATCTGCA TGTATGATCA CAACCGCATA TCCATTGAAG GGTCGACCGA CCTTGCCTTT
ACAGAAAGCG TGCACCAGAG GTTCGAAGCA TATGGATGGC ATGTGGTGGA AATCGACGGC
AACGACCCTG AAGCTATCGA GGAAGCGTTA CATGCCGCCC GTCAGGTAAC CGGGAAACCC
TCGATGATTA TCGCCAAAAC CAACATAGGC TTCGGGAGCC CTAACAAGCA GGACAGTGCC
TCGTCGCACG GCGCCCCTCT TGGAGCTGAA GAGGTCGCGC TGGTAAGAAA ATATTTCGGA
TTCCCTGAAG AGAGCTCCTT CTTTGTCCCT GAATCGGTAG CCGCCCACAT GAGCGCCGTG
TGTGAAAAAG GGAGCCGTTC TGAAACAACA TGGAATGAAC TGTTCAACAC ATACGGTAAA
AGCCATCCCG AACTCGCTGA GGAAATGGAA ACCATGCTCC GCAACGAGCT GCCTGAAGGA
TGGGAAACAT TACTTCCTCA ATTCAGCCCT GAAGAAAAAC TTGCAACCCG TCAGGCATCT
TCCAGGGTAC TGCATGCGCT GGTAGGAAAA ATCCCGTTTT TGGTGGGTGG TTCAGCCGAC
CTCGCACCAT CAACCGGTAC AGAAGTCAAA CATGCTACCG ATTTCACCTC TGAAAATTAT
GGCGGAGCTA TTTTTCGATT CGGCGTCAGG GAACATGCCA TGGGCGCCAT TATCAACGGC
ATGGCCCTCT CCCGTATTCT CATTCCCTAC GGAGCGACCT TCCTTGTTTT CGCGGATTAT
ATGAAACCCG CTCTTCGCCT CGCAGCTATC ATGCAGGTCC CGTCTATTTT CATATTCACT
CATGACAGTA TAGCTGTCGG GGAAGACGGT CCGACACATC AGCCGATCGA ACAGCTGGCC
ATGATGCGTT CAATACCGGG CCTGACCGTT ATCCGCCCGG CAGATGCGCA GGAGACAAAA
GCGGCTTGGT ACATCGCCCT GACGCAGAAC AAACCTACGG TGCTTGTCTT TTCAAGACAG
ACACTCCCGG TACTCGACCA GGAGAAATAC CCTGTCGTGA AAGGAACCCC CAAAGGAGCC
TACATACTTT CCGAATGGAG CGCCCCGTCG ACAGATGGCA ACAGACCGGT AATACTTATC
GCGACAGGCG CTGAAGTTCA CCTTGCCCTT GAAGCACAGA GCGCTCTTCT CAACGCGGGC
GTTCCGGCCA GAGTTGTTTC CATGCCTTCT CGAGAACTGT TCGAACAGCA GCCCGAGTCT
TACCGAAACG AGGTTTTGCC GCCTTCAATA CGACGGAGAA TCGTCATTGA AGCCGCGTCT
CCTTTCGGAT GGGACAAGTA CGCAACAGAT GAAGGGAGCA TTCTGGGCAT AAACCGTTTC
GGAACGTCCG CCCCGGGAAA CACGGTATTG CGTGAATACG GTTTCAGCGC CGCCGCTATT
GTCGAAGCCG CGAAAAACCT GCAATAG
 
Protein sequence
MHTDPIDQLA INTVRMLAVD MVEKARSGHP GMPMGAAPMA YVLWTKIMKH NPDNPEWINR 
DRFVLSAGHG SALLYSLLHL TGYDLSMDDL RQFRQWGSKT PGHPEYGHTP GVETTTGPLG
QGLSNAVGMA IAERYCATRL NKPDMELIDY STYVICGDGD LMEGITSEAA SIAGHLRLGK
LICMYDHNRI SIEGSTDLAF TESVHQRFEA YGWHVVEIDG NDPEAIEEAL HAARQVTGKP
SMIIAKTNIG FGSPNKQDSA SSHGAPLGAE EVALVRKYFG FPEESSFFVP ESVAAHMSAV
CEKGSRSETT WNELFNTYGK SHPELAEEME TMLRNELPEG WETLLPQFSP EEKLATRQAS
SRVLHALVGK IPFLVGGSAD LAPSTGTEVK HATDFTSENY GGAIFRFGVR EHAMGAIING
MALSRILIPY GATFLVFADY MKPALRLAAI MQVPSIFIFT HDSIAVGEDG PTHQPIEQLA
MMRSIPGLTV IRPADAQETK AAWYIALTQN KPTVLVFSRQ TLPVLDQEKY PVVKGTPKGA
YILSEWSAPS TDGNRPVILI ATGAEVHLAL EAQSALLNAG VPARVVSMPS RELFEQQPES
YRNEVLPPSI RRRIVIEAAS PFGWDKYATD EGSILGINRF GTSAPGNTVL REYGFSAAAI
VEAAKNLQ