Gene Paes_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1803 
Symbol 
ID6458796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1968951 
End bp1971068 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content54% 
IMG OID642725787 
Productshort chain dehydrogenase 
Protein accessionYP_002016462 
Protein GI194334602 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAACC TCTGGGATAC TTCAGGCTGC GTCTCATGTA TCGAAGCGCT TTGTTCGGGG 
ACCGAAATAC CTCGTGAGCT CTGTGAACTT GTCTATGCGT CACGTCTTCT CGGTCGCGAA
AACAGGCTGG TCATGCATGG CGGAGGTAAT ACATCGGTCA AAAGCGAGCT TACCGATGTT
CTGGGCAACA AGGCCAATGT GCTGTTTATC AAAGGCAGCG GAGTTGATCT CAAAGAGATT
GTTCCGCACG ATTTTACTCC TGTCCGGCTT GAGCCGATGC AGAGATTGCT CAAGCTGATC
CACGAGGGGA GGGGCCATAG CGAAGAGGAC CTTCGTCGTT TTTCCAACAA GGAGTTCAAG
AACTTTCTTT TTTTGAACCT GTTCAGCCTG AACGACAATA TGGTGCAGAA GGGTTTGACG
CCATCGATCG AGACCCTCTT GCATGCATTC CTTCCCCATC GCTATATTTT CCATACTCAT
GCACAGGCAT TGCTCACCTT GAGCAATCAG CCGGATGGTG AAGCGCTCTG CAGGGAGGTG
CTTGGTGATC GTATGGGTAT TGTGCCCTAT ATCCGGCCCG GTATCGAGCT TGCATGCTCT
GCCGAGCGGG TCTATGCCAA CCACGCTGAC ATAGAAGGCC TGGTGCTGCT CAAACACGGT
CTGGTGACGT TCGGCGAGAG TGCAAAAGAT GCCTACGATC GAATGATCTC TGCTATCAGC
ACAATGGAGG ATCGCATCGA CAGGGCCGGT CGGAAGGTGT TTGCGCAGAC GGAACTTCCC
GGATCGCTGA TGGCGGTCGA GCATGTCGCT CCGGTTATTC GCGGTGCATG CGTTCAGGAA
CGCACGCCAC GTAGCAGAGA CTACGATACG TTTATTCTCG ACTTCCGTAC GTCGCCCGAT
ATTATCGAAT ATGTCAACAG AGCCGATCTG GCACAACTCA GCGCCAAAGG CGCTATGACC
CCTGACTTCA TCATCAGGAC CAAGAACCGG CCTCTCGTGG TTCCGGCACC TGATGCTGCC
GATCCCGATG CATTCCGTCG CGATGTGCAT GAGGCTGTTG ACGCCTATAA AAAAGAGTAT
ACAGCCTATT TCGAGCGCCA GAAGGCGGCG AGCGGCATGG ACGTCGAGAT GCTCGACCCG
CTGCCCAGAG TTGCGCTGGT TCCCGGTCTC GGGCTTTTCG GTCTGGGCAG GACTCTTCGC
GACGCCCGCG TGAATGCCGA TATCGCAGAA AGTTCTGCCG AAGCGGTGCT GAAGGCCGAA
AGCGTTGGTT CCTTCGAGTC CATCAGTGAA AAAGAGGTGT TCGAGATTGA GTATTGGGAG
ATGGAGCAGG CCAAAGTCAA AAAAGTCCGT CACGATGTCT TTGCAGGCCG TGTTGCAATG
GTTACCGGCG CTGCAGGCGG GATCGGTCTT GCGACAGCCA AAGCATTCAG GGAAAAAGGC
GCTGAAATTG TCATCCTGGA CCTCAATCAG GAATCCCTTG ATAAAGCGTG TGAAGAGATC
GGCCCCGACA CGCTGGCCAT TGCATGTGAC GTCACCGATC GAAGCGCGAT TGCCGAAGCC
TTCAACCGCA CGTGTCGGAT GTTCGGCGGG ATTGATATTG TTGTTTCCAA TGTAGGCATC
GCCATTCAGG GCAGGATCGG CGATGTCGAC GAAGCGCTTC TTCGCCGGAG TTTCGAGTTG
AATTTCTTTT CTCATCAGAC TATTTCACAG CACGCTGTTG CTGTCATGCG ACGGCAGGGT
ATCGGCGGTA ACCTGCTCTA CAACGTTTCA AAACAGGCTG TCAATCCCGG ACCGGATTTT
GGAGCATACG GACTGCCAAA GGCTGCGACC CTCTTTCTCG TTCGCCAGTA TGCGCTTGAT
CACGGCAGGG ACGGTATCAG GGCAAACGGC ATCAATGCCG ATCGTATCCG TACCGGTCTT
CTCAATGAAG AGATGATCGC AAAGCGCTCG AAAGCCAGGG GACTGAGCCC TGAAGAGTAT
ATGGCAGGCA ACCTTCTCAA GCTCGAGGTG ACGGCTGAAG ATGTCGCCCA GGCGTTCGTC
CACCTTGCGC TCGAGCAGAA GACAACCGGT TCGATCACCA CCGTTGATGG CGGCAACATT
GCAGCAGCCC TCAGATAA
 
Protein sequence
MQNLWDTSGC VSCIEALCSG TEIPRELCEL VYASRLLGRE NRLVMHGGGN TSVKSELTDV 
LGNKANVLFI KGSGVDLKEI VPHDFTPVRL EPMQRLLKLI HEGRGHSEED LRRFSNKEFK
NFLFLNLFSL NDNMVQKGLT PSIETLLHAF LPHRYIFHTH AQALLTLSNQ PDGEALCREV
LGDRMGIVPY IRPGIELACS AERVYANHAD IEGLVLLKHG LVTFGESAKD AYDRMISAIS
TMEDRIDRAG RKVFAQTELP GSLMAVEHVA PVIRGACVQE RTPRSRDYDT FILDFRTSPD
IIEYVNRADL AQLSAKGAMT PDFIIRTKNR PLVVPAPDAA DPDAFRRDVH EAVDAYKKEY
TAYFERQKAA SGMDVEMLDP LPRVALVPGL GLFGLGRTLR DARVNADIAE SSAEAVLKAE
SVGSFESISE KEVFEIEYWE MEQAKVKKVR HDVFAGRVAM VTGAAGGIGL ATAKAFREKG
AEIVILDLNQ ESLDKACEEI GPDTLAIACD VTDRSAIAEA FNRTCRMFGG IDIVVSNVGI
AIQGRIGDVD EALLRRSFEL NFFSHQTISQ HAVAVMRRQG IGGNLLYNVS KQAVNPGPDF
GAYGLPKAAT LFLVRQYALD HGRDGIRANG INADRIRTGL LNEEMIAKRS KARGLSPEEY
MAGNLLKLEV TAEDVAQAFV HLALEQKTTG SITTVDGGNI AAALR