Gene Paes_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1904 
Symbol 
ID6460080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2081128 
End bp2083134 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content54% 
IMG OID642725889 
Producttransketolase 
Protein accessionYP_002016563 
Protein GI194334703 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAG ATCCTATCGA TACCCTTTCC ATCAATACCA TTCGTCTGCT TGCCGCAGAC 
ATGGTCGAAA AAGCCGGATC CGGGCACCCC GGTATGCCAA TGGGGGCCGC CCCCATGGCC
TATGTTCTCT GGACAAAAAT CATGAAACAC AATCCCCGGG ATCCGCATTG GCTGAACCGT
GACCGTTTCG TCCTGTCCGC AGGTCACGGT TCAGCCCTGC TCTACTCGCT TCTCCACCTC
TGCGGCTATG AGCTGACCAT GGAGGACCTG CAGCAGTTCC GGCAGTGGCA GAGCAGAACT
CCGGGGCACC CTGAATACCG CCATACCCCC GGTGTTGAAA TGACGACAGG ACCGCTCGGT
CAGGGCATAG CAACCGCTGT CGGCATGGCT GTCGCCGAAC GGTTTTCTGC TGAACGCCTG
AACCGTCAGA GCTTCAATAT TATTGATTAT CATACCTATG TCATCTGCGG AGACGGCGAT
CTTATGGAAG GCCTCTCTTC CGAAGCAGCC TCCATTGCCG GCCATCTCAA GCTCGAAAAA
CTGATCTGCC TCTATGACGA CAACAGGATC TCCATCGAAG GCCCGACCAG CCTCGCATTC
ACTGAAAATG TCGGCAAACG GTTCGAGGCT TTCGGCTGGA ATGTTCTGGA CGTTGACGGT
AACGATCCCG AGGCGGTCGA AGAGGCGTTG CGGAAAGTCA GAAAAGGAAC AGGCAAACCA
AACCTGATCA GGGCAACCAC CAACATCGGC TATGGAAGCC CGAACAAACA GGACAATGCT
GCATCCCATG GATCGCCGCT GGGAAAAGAG GAACTTGCCA TGGTCAGGAA AAAATTCGGT
TTTCCCGAAG ATGAATCATT CGTCATTCCT GAAGCGGTAA CCGCTCATAT GCGAGCTATC
AGCGATAAAG GAGAACAGAG CCAGGCTGAG TGGACAACAC TGTGGGAAGC ATATAGCCAA
GCCCATCCTG ATCTGGCAAC CCAGATCAAC GACCTTATCA ATAAACGTCT GCCGGACTCC
TGGCAGACGC TTCTTCCCGA ATTCAACCCG GAAGAACAGC TCGCCACGCG CCAGGCACTC
AATAAAATCC TGCATGCTCT GATTGGCAAA ATCCCTTTCC TTGCAGGCGG TTCGGCCGAT
CTGGCCCCAT CGAACGGGAC AGCAGTCAAA AACGCTGAGG ATTTCAATCC CGACAATTAC
GGCGGCACAA ATTTCCACTT TGGCGTCAGA GAACATGCCA TGGGCGCGAT CATCAACGGC
ATGGCGCTTT CAGGAATGCT CAAGCCCTAC GGAGCGACAT TCCTGGTCTT TGCAGACTAT
ATGAAACCTG CCCTGAGACT TGCGGCACTG ATGCAGATCC CTTCAACATT TATTTTCACC
CATGACAGCA TTGCTGTTGG TGAAGACGGG CCGACACACC AGCCGATCGA ACAGCTGGTC
ATGCTGCGAT CAATTCCTGG TATGACAGTC ATCAGACCTG CTGATGCCAA TGAAACGAAA
GCTGCATGGA AATATATCAT GACAGCACAG ACCCCGGTAT CACTGATACT CTCTCGCCAG
GCCCTTCCCA TACTCGACAG CAGCCTCTAC CCCTCTGAAG AAGGAACAGC CAGAGGAGGC
TATATTCTGG CTGATTGGGG AAACGCTTCA GGCAGCGGAT CCAAGCCGGT TATCCTCATC
GCAACAGGGG CAGAAGTCCA CCTGGCAGTG CAGGCACAGC AACAACTTGA AGAAGAAGGC
ATTCCTGCCC GCGTGATATC GATGCCATCA GTCGAACTCT TCCTGCAGCA GCCCGAAGAA
TATCGAGAAG AAGTCCTGCC GACCTCAATA CGCAGAAGAG TCGTTATCGA AGCAGCCTCA
ACTATCGGCT GGCACCGCTT TATCACTGAG GAAGGAACGA TTGTGGGTAT CGACAGGTTC
GGCTCTTCGG CTCCGGGAAG CAGAGTGCTC AAGGAGTACG GATTCACGGC AGAGCATATT
GTAGCGACAG TAAAAACCCT TCTCTGA
 
Protein sequence
MQQDPIDTLS INTIRLLAAD MVEKAGSGHP GMPMGAAPMA YVLWTKIMKH NPRDPHWLNR 
DRFVLSAGHG SALLYSLLHL CGYELTMEDL QQFRQWQSRT PGHPEYRHTP GVEMTTGPLG
QGIATAVGMA VAERFSAERL NRQSFNIIDY HTYVICGDGD LMEGLSSEAA SIAGHLKLEK
LICLYDDNRI SIEGPTSLAF TENVGKRFEA FGWNVLDVDG NDPEAVEEAL RKVRKGTGKP
NLIRATTNIG YGSPNKQDNA ASHGSPLGKE ELAMVRKKFG FPEDESFVIP EAVTAHMRAI
SDKGEQSQAE WTTLWEAYSQ AHPDLATQIN DLINKRLPDS WQTLLPEFNP EEQLATRQAL
NKILHALIGK IPFLAGGSAD LAPSNGTAVK NAEDFNPDNY GGTNFHFGVR EHAMGAIING
MALSGMLKPY GATFLVFADY MKPALRLAAL MQIPSTFIFT HDSIAVGEDG PTHQPIEQLV
MLRSIPGMTV IRPADANETK AAWKYIMTAQ TPVSLILSRQ ALPILDSSLY PSEEGTARGG
YILADWGNAS GSGSKPVILI ATGAEVHLAV QAQQQLEEEG IPARVISMPS VELFLQQPEE
YREEVLPTSI RRRVVIEAAS TIGWHRFITE EGTIVGIDRF GSSAPGSRVL KEYGFTAEHI
VATVKTLL