Gene Paes_1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1395 
Symbol 
ID6458765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1513955 
End bp1516117 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content52% 
IMG OID642725380 
Productprotein of unknown function DUF255 
Protein accessionYP_002016063 
Protein GI194334203 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0132403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.87283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATGA AAGAGAAAAA CAAAGTCCCT AACGCTCTCA GCAAAGAAAA GAGTCCTTAT 
CTGCTCCAGC ACGCATACAA CCCTGTGCAA TGGCTTGCCT GGGGGCCAGA CGCCTTCAAT
ACATCGCTAC GGGAGGATAA ACCGATTTTT CTCTCCGTCG GATATTCCAC CTGCCACTGG
TGCCATGTTA TGGAGCGTGA ATCGTTTGAA AATGACGAGA TCGCACAAGT GCTCAACCAT
AGCTTTGTTC CGGTAAAAAT CGATCGAGAA GAGCGCCCCG ACATCGACCG GCTCTATATG
GCCTATGTGC AGGCCTCGAC AGGCTCGGGC GGCTGGCCGA TGTCTGTCTG GCTCACACCT
GAGCTCAAAC CCTTTTACGG CGGGACCTAC TATCCTCCTG AAGACCGCTT CGGACGTCCG
GGATTTCTTT CGCTGCTCCA CTCCATTGCC GACGCATGGA AGGAAGACAG AAAAAAGCTG
GAACACGTAG CAGACGGGAT ACAGAGCCAG CTCAAATCCT TTTCCACTGC CGCACCTCAC
CCTGAAAGCC TTGGCGAGAA AGTGCTGGAC GATGCATTCA TGCAGATATC AAGCCACTTC
GATCCCGTAG CCGGAGGATT CAGCAGCGCT CCGAAATTTC CAAGACCTTC GATCCTGACA
TTTCTGTTCA ATTATGCTTA CTTCACCGGT CGTGAGGAAG CCTCGGCCAT GGCACTCCTG
ACCCTTGAAC GCATGGCCAG GGGCGGCATC CATGACCATC TCGGTGTCAA AGGAAAAGGA
GGCGGCGGTT TTGCCCGTTA CGCCACCGAT GCGTTATGGC ACGTCCCCCA CTTTGAAAAA
ATGCTCTACG ACAATGCCCT GCTCGCGCTC TCCTTCCTCG AAGCGTTTCA ACTGACAAAA
GAAACACTCT ATGCTCAAAC GGCAGAGGAT ATTTTCAATT ACGTCCTGTG CGACATGACC
TCTCCTGAAG GTGCTTTTTA TTCGGCTGAA GATGCCGACA GCTTCCCTGA TAGAGAGAGC
AAAACGAAGA TCGAAGGAGG ATTTTATGTC TGGACAAAAA CAGAGATCGC AGAACTCCTT
GACCCTCTCG AGGAGCAGAT TTTCTCGTTT CGCTACGGCG TCAAACAAAA CGGCAATGTC
CTTGAAGACC CGCACGGAAC GTTTGAGAGG AAAAATATTC TCTCGCTGAA GGCTGATGAA
GAAACCACCG CCAAACACTT CGATCTCCCG ACAGATCAGG TGGCAAACCT GAGCAGGTCT
GCAATCGAAA AACTGTTTCA AGCCCGCATG AGACGTCCTC GACCCGACAG GGACGATAAA
ATCATCACCT CATGGAACGC TCTGATGATC TCTGCACTGG CAAAAGGCAG CCGCGTCTTG
CAGAATACAG ACTATCTCAC CGCAGCCGAA AAAGCGGCCG GATTTATCGG CGACAATCTC
TTTGAAAACG GCACGGGAAA TCTCTTACGG CGCTACTGCA AAGGAGAATC GGGAATTACG
GGCCAGGCAG AAGACTACGC CTTTCTCATC CAGGGCCTGC TGGATCTCTA TGAAGCCTCT
TTCGACGACT CCTTGCTCCA CAAGGCGCAG GAACTTGCGG AACGTCAGTG TGAGCACTTC
TATGATGACG AACATGGAGG TTTTTTCAAT GCCTCGTCGC AGGAGGCCTC TGTGCCGATA
CGCCTCAAAG AGGACTATGA TGGCGCCGAA CCATCGGCAA ATTCTGTCAG TGTCATGAAC
TTCAGCAGAT TATGGCTGAT GACAGGCAAA CAGCACTATC TCGATATCGC CGAAAAGACC
CTGTACTACT TCAGCGCTAT ACTGGCCGCA AACGGCATGC AGCTCCCGGA AATGCTCGCG
GGATACGCTC GGCTGCTCCA TCCGTCTAAC ACCGTCATCC TGACCGGCTC ACAATCGGAC
CCTGCATTCA AAGCATTGAA AAAAAGCGTA GAGCAGCTCT ATCTTCCCGG CACAACAGTG
ATGCATGCCA CAAAAGAAAA ACCGGTCAGT TCAATACCCG GAGCTGAAAC AGCAAGCGAG
GAGAACAATT CCGCAGCCGC ATATATCTGT AAAGGAGGGA GTTGCCGGTT ACCTGTCACT
ACTCCCGAGG AGGTGACGAA CCTACTCCGG CCATCGGGCC GAAGTGCAGG CAAAAAAAGC
TGA
 
Protein sequence
MTMKEKNKVP NALSKEKSPY LLQHAYNPVQ WLAWGPDAFN TSLREDKPIF LSVGYSTCHW 
CHVMERESFE NDEIAQVLNH SFVPVKIDRE ERPDIDRLYM AYVQASTGSG GWPMSVWLTP
ELKPFYGGTY YPPEDRFGRP GFLSLLHSIA DAWKEDRKKL EHVADGIQSQ LKSFSTAAPH
PESLGEKVLD DAFMQISSHF DPVAGGFSSA PKFPRPSILT FLFNYAYFTG REEASAMALL
TLERMARGGI HDHLGVKGKG GGGFARYATD ALWHVPHFEK MLYDNALLAL SFLEAFQLTK
ETLYAQTAED IFNYVLCDMT SPEGAFYSAE DADSFPDRES KTKIEGGFYV WTKTEIAELL
DPLEEQIFSF RYGVKQNGNV LEDPHGTFER KNILSLKADE ETTAKHFDLP TDQVANLSRS
AIEKLFQARM RRPRPDRDDK IITSWNALMI SALAKGSRVL QNTDYLTAAE KAAGFIGDNL
FENGTGNLLR RYCKGESGIT GQAEDYAFLI QGLLDLYEAS FDDSLLHKAQ ELAERQCEHF
YDDEHGGFFN ASSQEASVPI RLKEDYDGAE PSANSVSVMN FSRLWLMTGK QHYLDIAEKT
LYYFSAILAA NGMQLPEMLA GYARLLHPSN TVILTGSQSD PAFKALKKSV EQLYLPGTTV
MHATKEKPVS SIPGAETASE ENNSAAAYIC KGGSCRLPVT TPEEVTNLLR PSGRSAGKKS