Gene Paes_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1843 
Symbol 
ID6460209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2015685 
End bp2017826 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content54% 
IMG OID642725827 
Productthiol-disulfide interchange protein DsbD precursor-like protein 
Protein accessionYP_002016502 
Protein GI194334642 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein
[COG4233] Uncharacterized protein predicted to be involved in C-type cytochrome biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTAC GCCGGATTTT TTCATTGATC ATACTGCTGA ACCTTGCTGT TCATGCCTCC 
CTGACTGCTG CAATGGCCTT TGTCTTCATG GAGGATACAG CAGAGCCAGT CAAGGCAGAA
CTCTTTGCAG CCGGAGAGCT TGATGATGGA GGTCTTCTTG TTGCCGTTCA TCTTACCATT
CAGCCAGGCT GGCATCTCTA CTGGGAGCAT CCCGGTGAGG CGGGCATGCC GGTCAGCATC
ACATGGAGGC TTCCCGAAGG CTACCGTGCT CTTCCCCTTG AATTTCCTCT TCCCGGCCGA
TTTGAGGCGG CCGGTCTTAT CGGCTATGGA TACGATGAAG AAGTTGTGCT TTTTTCAGCC
ATTGTCCCGC AGGTTCCCGG CAACAGCCAA CGGCTTCGGT CAGACCTTTT TCCGCTTCAG
GCAGAAGTGA GCTGGCTTTC ATGCAGGGAA AGCTGTATCC CCGGATCCGC ATCGCTTCGT
CTCGAAACCG GAGAGCCGGA TTCCTCGCGT TCAGGGCTGG TCGATCTGTG GCGTTCGAAA
GTTCCCGTTG CGTCCAGCTC CGCCACCGTT GCTGTCGAGC GGTTTGTTCT CGAAGAGGTG
AATGGCGATT TCAGGCTTCG TGCTGATCTT GCAGGTCCGG ATGCAGATGC GATCAACGGT
TTTTTTCCCC TCTCTCCACT TGAGGGTCTT GATCTCAAAG GGATCGAAGC TGCTCCGGGG
CGACTGTTTC TTCCTTTGAA GGGGGCTGAT TTTCCTGAAA CCGTATCCGG GGTTCTGGTT
ACTGGGAACG GTGCCTATCG TACGGAGAAG ATTCCGGTCG AGATCTCTTC CGGCAGTTCC
GCTCCGGATG ATGCGGGCTC TCTTGGAGCG ATGCTTTTCC TCGCCTTTTT TGGCGGCATG
CTGCTCAATA TTATGCCCTG TGTACTGCCT GTTCTCGGTC TCAAGGTCTT CAGCCTGATC
GGGCCGGGTG GAGATCGTGC TGCAGGACGT TTCCTGAGCC TTGTTTTTGC CGGAGGAGTG
CTTTTTTCTT TCTGGGTTCT CGCAGCATTT ATCTGGGGTT TGCAGGCAAT GGGCGCCCAG
GTTGGCTGGG GATTTCAGTT TCAGTCTCCG GCTTTCGTCA TGTTTATGGC TGCTGTAGTG
TTTGCATTCG CTCTCAACCT GTTTGGTGTT TTTGAGTTCA GCGCTCCGGT TGTTTCAGGA
CGGCTTGGAC GGCTCGCTTC CCATCATGAC AGTGCGGGTG CTTTTGTCAG CGGGGTTCTT
GCCACGACAC TGGCGACACC CTGTACAGCG CCTTTTCTTG GAACAGCCCT TGGTTTTGCT
TTTGCACAGC CACCCGGTAT CATTTTTCTC ATTTTTACGG TTATCGCTGC GGGTATGGCG
ATGCCGTATG TTGTTCTGGC ATGGCATCCT TCATGGTTGA AGTTTCTTCC GAGGCCAGGC
CAATGGATGT ATCGCTTCAA ACAGATCATG GGTTTCATTC TCGTTGCTGT TGTCGTCTGG
CTCGCCTCAA TATTGGGCAG TCAGGGGGGG AGCCCGGCTA TGCTGAACCT GTTTGTGCTC
CTTTTTGCTG TCTCATTTGT TCTCTGGCTT ACCGGTCTTT TTACTGCTCC CGGTACGTCG
ACGCTTCGTC AGGCGCTCGT CTGGCTGATC ACGATAGGCT TTCTCGCCGG GGCTTATATG
CTTCTTTCCG GAAGGATCGG AACGATATCG TCTGACTCCC GCAACAGTCT GGAACCCGGC
AGCCATACCG ATGGTAACGG CGTCGTCTGG CTTGACTACA GCCCGGATGT TCTTGATACT
CTCCTGAAGG AAGAAAAGAG CGTCCTGATC GATTTTACCG CAGAGTGGTG TTTGACCTGT
AAGGTGCTGG AAGCAAGTGT TCTCGGCAAT GAAGAGATTG GACGTGCGCT GAAGCGTGAG
GGGCTTGTCG CGGTCAGGGC TGACTGGACA AGTCGAAATG ATGAAATTAC CGCTCTTCTC
CAGCGTTTTG GCCGTTCCGG CATCCCTCTT CTTGTCATTA TTCCTCAGGG GCGGATAGAC
AATGCTGTGG TGCTGCCTGA AGTCGTTACC GTCGATATGC TTCTCGACGC GCTCTCCGGG
GTTTCCGGCG ATTCGCTTTC AAAGCTCCAG GGCAATATGT GA
 
Protein sequence
MRLRRIFSLI ILLNLAVHAS LTAAMAFVFM EDTAEPVKAE LFAAGELDDG GLLVAVHLTI 
QPGWHLYWEH PGEAGMPVSI TWRLPEGYRA LPLEFPLPGR FEAAGLIGYG YDEEVVLFSA
IVPQVPGNSQ RLRSDLFPLQ AEVSWLSCRE SCIPGSASLR LETGEPDSSR SGLVDLWRSK
VPVASSSATV AVERFVLEEV NGDFRLRADL AGPDADAING FFPLSPLEGL DLKGIEAAPG
RLFLPLKGAD FPETVSGVLV TGNGAYRTEK IPVEISSGSS APDDAGSLGA MLFLAFFGGM
LLNIMPCVLP VLGLKVFSLI GPGGDRAAGR FLSLVFAGGV LFSFWVLAAF IWGLQAMGAQ
VGWGFQFQSP AFVMFMAAVV FAFALNLFGV FEFSAPVVSG RLGRLASHHD SAGAFVSGVL
ATTLATPCTA PFLGTALGFA FAQPPGIIFL IFTVIAAGMA MPYVVLAWHP SWLKFLPRPG
QWMYRFKQIM GFILVAVVVW LASILGSQGG SPAMLNLFVL LFAVSFVLWL TGLFTAPGTS
TLRQALVWLI TIGFLAGAYM LLSGRIGTIS SDSRNSLEPG SHTDGNGVVW LDYSPDVLDT
LLKEEKSVLI DFTAEWCLTC KVLEASVLGN EEIGRALKRE GLVAVRADWT SRNDEITALL
QRFGRSGIPL LVIIPQGRID NAVVLPEVVT VDMLLDALSG VSGDSLSKLQ GNM