Gene Paes_0211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0211 
Symbol 
ID6458396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp219336 
End bp222653 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content53% 
IMG OID642724202 
Producttrehalose synthase 
Protein accessionYP_002014915 
Protein GI194333055 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAA CACGCACAAC CGGCCAGCCT GAGCCGCAAT GGTACAAGGA TGCCATCATT 
TATGAAGCGC ATGTCAAAAC ATTTTTTGAC AGCAACAATG ACGGTGTCGG TGATTTTGAG
GGGCTTCGTC AGAAGCTGCC CTATCTTGAA AGTCTTGGTA TTACGGCGAT CTGGCTGTTG
CCGTTTTATC CTTCTCCGCT GAGAGATGAT GGGTATGATA TTGCCGATTA TATGGATATC
AATCCCGATT ACGGTACGAT CGACGACTTT AGAGCATTTC TCAATGAAGC GCATGAGCGC
GGGCTGAAAG TTATTACCGA GCTGGTTATC AACCATACCT CCGATCAGCA TGCATGGTTT
CAGAGGGCCC GTCGTGCAGA ACAGGGCTCC GACGAGCGCA ATTTCTATGT GTGGACTGAC
GATCCGAAAA AGTATTCGGA AACACGGATT ATTTTCCAGG ACTTCGAGTC GTCAAACTGG
ACTTGGGACC CGATCGCCGG ACAGTATTTC TGGCATCGTT TCTACCACCA TCAGCCCGAT
CTTAATTTTG AGAACCCCTC GGTTGAAAAG GCGCTCTACA AGGTGCTTGA CTACTGGCTG
GAGATGGGTG TGGACGGGTT GCGCCTTGAC GCTGTACCGT ATTTGTACGA GGAAGAGGGC
TCCAACTGCG AGAACCTGCC GCGTACGCAT GAGTTTTTGA AGCGTCTTCG CAAGCATGTC
GACGACAAGT TCCCTAACCG GATGCTGCTT GCCGAGGCAA ACCAGTGGCC TGAAGATGCT
GCCGAGTATT TCGGGGATGG CGACGAGTGC CATATGAATT TTCATTTTCC GCTCATGCCG
AGGATGTATA TGGCCCTGGA AATGGAGGAC CGTTTTCCGA TCATCGATAT TCTCGACCAG
ACGCCCGAGA TTGCCGAAAC CTGCCAGTGG GCTTCGTTCC TGCGTAACCA CGATGAGCTT
ACCCTCGAAA TGGTGACCGA TGAGGAGCGC GACTACATGA GGCGGGTCTA CGCTCATGAT
CCGAAGGCAA GGATCAATCT CGGTATCCGC CGCAGACTTG CGCCCCTGAT GTCGAACGAC
CGGCGCAAGA TCGAACTGAT GAACATCATG CTGCTTTCCT TGCCGGGAAC ACCGGTACTC
TATTACGGTG ACGAGATCGG CATGGGGGAT AACTTCTACC TTGGAGACCG TGATGGTGTG
CGGACCCCCA TGCAGTGGAA CGGGGACCGC AATGCCGGTT TTTCGCGGGC CAATCCGCAA
CAGCTGCAGC TGCCCGTTAT TATCGACCCC GAGTATCACT ACGAGGCAAC CAATGTCGAG
GTGCAGGACA GCAACATCAA CTCTCTTCTG TGGTGGACTC GTCACATGCT CTCAACCTCG
CGCCGTTACA AGGCGCTGAG CCGTGGAGAT ATACGCTTTA TCGCATGTCA GAATCCCCAG
ATCCTGATTT TCAGCAGGAC TTTTGAGGAT GAAACGATGC TTTGTATCAT CAACCTTTCC
CGTAACGCCC AGGCGGCCAC GGTCGATCTG TCGGAGTATG AGGGCCATAC TCCCGAGGAG
GTGTTCAGTC TCAGCCACTT CCCAGGCATT ACGTCGAGGC CCTATACGGT AACGCTTGGC
CCGTACGGTT ATTTCTGGTT CAAGCTCATC AGGACCGAAG AGGAGATCGG CTCACGCCGC
TATGTTGACA AGCCCTTTGC TCAGGTTGCT TCTCTGAAGG ACCTTTTCGC CGGTAAAGCC
CTCGAACGTC TTGAAACGAA AGTTCTTCCT GAATATATCA GGGGGTGCAG GTGGTTCGGT
GGCAAAGCGC GCAAGATTGT CAGAGTGAGT GTCGAAGAGC ATATTCCTGT GACGGCATGC
GACAACACGG TCTACATGAT TGTCGAAGTA CGTTATCCCA GCGGGTCGAA CGATACCTAT
CAGCTTCCGG TGACGTTTCT CCCGGCTGGA GAGTTCAATC CTGATGATGA CTACTTTCTC
AAGCAGGTCG TCTGCAGCGT AAAAATAGGT GATGAGGAGG GTTATCTCTG CGATTCGGCT
TATCAGAAAT CGTTCCACAG CTTCCTGCTC GATACCATTG TCAGCAGTAA GGGGCTGAAA
GGATCGACCG GTAAACTGAC GGGTGAAAAG GGTTCCCGTG TTGAAGAGTT TCTTGAAACC
CAAGACGATG GAGAGATGAA TTCCGTGCTT TTCGGGGCCG AGCAGAGCAA TACGTCGATC
ATGTATGGCG ACAGGCTTTG CCTGAAGATC TATCGTAAGA TCTCTTCAGG GGTTTCTCCT
GAGGTGGAGA TCTGCCGTAT GCTGACAGAA AAGACATCGT TCGAAAGTTC GCCCGCGTAT
CTTGGTGCCC TTCATTTTAC CCGGAGCCGT AAAGACCAGT GCTCCCTCGG GATTCTGCAG
AATTTCATTC CGAACGAGGG TGATGCATGG AGCCAGACGC TGCATTTCGT GCATCGTTAT
TACGAAGATG TGCTCGTCAT GCTGCCGCAG ATTTCAGAGG TACCCGTCCT CCCATCTACC
GGAGGAGAGT CTGTCGAAAT GCCGGAAATT ATTCATGGCC TTATCGGAGA ACCTTACCTC
GAGATGGTGT CGAGACTGGC TGAAAGGACT GCCGGAATGC ACCTTGCACT GGCCTCACCG
GACCTTGGTC AGGACTTTAT CCCGGAACCC TTCACCACGC TTTACCAGCG TTCGATCTAT
CAGTCCATGA GGGAGCAGGT GAAGCGCGGC ATGGTGCTTC TGCGCGAGCA GATGGGAGGT
GTCGCTGAAG AGTACCAGGG GTTGGCTGCC GGGCTTTTGA AGCGCGAAGG GGAGATCCTC
GAACAACTGT CGCATATCAA AGCCCGCAAG ATCGCCGCGT CCAAGATCCG GATCCATGGC
GATTATCATC TCGGTCAGGT ACTGTGGACC GGTAAGGACT TTGTGATTAT CGATTTTGAG
GGCGAGCCTG CCCGATCGTT GAGCGAACGG CGTATCAAGC GTTCAGCGTT CCGTGATCTT
GCAGGTATGA TGCGTTCCTT CCATTATGCG GCATTCAACG TGCTGATTCA GGATCGTTCG
ATCAGGCAGG AAGATGTTGA ACGACTTGAG CCGTGGGCTG AACTGTGGAG CTTCTATACC
GGGCAGCACT TCTTCGATGT CTACGAAGAT GCCGTCAGGG GACAGGGGCT CATTCCTGAA
GATGTTAAGG AGCAGCATCT GCTTCTTCGC GCTTATCTTA TGGATAAGGC GATTTATGAG
TTGAACTATG AGCTGAACAA CCGTCCTGAA TGGGTCGGGA TTGCGCTCAA GGGTCTCAGC
AGGCTGCTCG AGTTGTAG
 
Protein sequence
MPKTRTTGQP EPQWYKDAII YEAHVKTFFD SNNDGVGDFE GLRQKLPYLE SLGITAIWLL 
PFYPSPLRDD GYDIADYMDI NPDYGTIDDF RAFLNEAHER GLKVITELVI NHTSDQHAWF
QRARRAEQGS DERNFYVWTD DPKKYSETRI IFQDFESSNW TWDPIAGQYF WHRFYHHQPD
LNFENPSVEK ALYKVLDYWL EMGVDGLRLD AVPYLYEEEG SNCENLPRTH EFLKRLRKHV
DDKFPNRMLL AEANQWPEDA AEYFGDGDEC HMNFHFPLMP RMYMALEMED RFPIIDILDQ
TPEIAETCQW ASFLRNHDEL TLEMVTDEER DYMRRVYAHD PKARINLGIR RRLAPLMSND
RRKIELMNIM LLSLPGTPVL YYGDEIGMGD NFYLGDRDGV RTPMQWNGDR NAGFSRANPQ
QLQLPVIIDP EYHYEATNVE VQDSNINSLL WWTRHMLSTS RRYKALSRGD IRFIACQNPQ
ILIFSRTFED ETMLCIINLS RNAQAATVDL SEYEGHTPEE VFSLSHFPGI TSRPYTVTLG
PYGYFWFKLI RTEEEIGSRR YVDKPFAQVA SLKDLFAGKA LERLETKVLP EYIRGCRWFG
GKARKIVRVS VEEHIPVTAC DNTVYMIVEV RYPSGSNDTY QLPVTFLPAG EFNPDDDYFL
KQVVCSVKIG DEEGYLCDSA YQKSFHSFLL DTIVSSKGLK GSTGKLTGEK GSRVEEFLET
QDDGEMNSVL FGAEQSNTSI MYGDRLCLKI YRKISSGVSP EVEICRMLTE KTSFESSPAY
LGALHFTRSR KDQCSLGILQ NFIPNEGDAW SQTLHFVHRY YEDVLVMLPQ ISEVPVLPST
GGESVEMPEI IHGLIGEPYL EMVSRLAERT AGMHLALASP DLGQDFIPEP FTTLYQRSIY
QSMREQVKRG MVLLREQMGG VAEEYQGLAA GLLKREGEIL EQLSHIKARK IAASKIRIHG
DYHLGQVLWT GKDFVIIDFE GEPARSLSER RIKRSAFRDL AGMMRSFHYA AFNVLIQDRS
IRQEDVERLE PWAELWSFYT GQHFFDVYED AVRGQGLIPE DVKEQHLLLR AYLMDKAIYE
LNYELNNRPE WVGIALKGLS RLLEL