Gene Paes_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1044 
Symbol 
ID6459935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1146709 
End bp1147935 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content53% 
IMG OID642725044 
Productamidohydrolase 
Protein accessionYP_002015730 
Protein GI194333870 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.10114e-08 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.94036e-06 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCAGAAA TTTCACGCGG AACCAGAGAC CGGATCGGCA GTCGGGCAGA TGAGTTATAT 
CCTCTTGTCC GCGATATCCG GCGCGATATT CATCGTCATC CGGAGCTTTC GTTTCAGGAG
TTCAGGACAA CGGCTCTTGT CAGGGATTAC CTGGAAAATC TCGGCTTTGA ATTCGCGCCC
CGTTACCTGG AAACCGGCGT CGTGGCGCTG CTGCGATCAC TGAACCCTTC AGCGCAGCAC
GAGAGGGTGG TGGTTTTGAG GGCGGATATC GATGCTCTTC CTTTGCAGGA GGAAAATATA
TCTGATTTCT GTTCGGGTGA GGCTGGATGC ATGCATGCAT GCGGCCATGA TATGCATACG
GCTATTCTTC TCGGGACAGC ATCTCTTCTC AGTGAATTTC GTCATGAGCT CCCGGGCGAT
ATCCTTTTTG TTTTTCAGCC GGCAGAGGAA AAGGCACCCG GAGGGGCTAA GCCAATGATA
GAGGCAGGCC TGTTCAGGGA CTATACTCCC GCGATGATTT TTGCTCTTCA CTGTTTTCCG
CATATCCGCT CAGGCAATGT TGCGCTTCGG GAGGGTAGTC TGATGGCTGC TGCTGATGAA
CTCTACATTA CGGTGCATGG AGAGGGGGGG CATGCATCAG CGCCGCATAA AGCAGCTGAT
CCCATTCTTG CTTCCGCTCA TATCATTACC GCGCTTCAGC ATCTTGTCAG CAGGGTTTCT
TCGCCATATG AGCCTGCAGT CCTGACTATC AGCTCAATTT CCGGCGGGCA TGCAACAAAT
GTGATTCCAG AGAATGTTGT CATGTCCGGG ACCATGCGAA TCATGAATGA AGAACTTCGT
TCGACCTTTC ATCATCGCCT GAAGAAAACC GTTGAACAGG TTGCCGATGC TTTAGGGGTT
AGCGCTGAAC TTGATATTGT GCACGGCTAT CCGGTTCTGG TCAACGATGC CGCAGCTTTT
GGCCTGGCGC GCGATGCTGC TGAAGAGATG CTCGGCGCCT CACATGTTGA GGAAAGCGAG
CCATTGATGA CCGCTGAAGA TTTCGCATGG TATCTGCAGG AGTGCCCTGG CGCTTTCATT
CAGTTAGGGA CCGGACGAAA TGAAGATCGC AAAGGGGACC AGTTGCACTC ACCATACTTC
GATCCCGATG AAGCGGCCCT GAAGACGGGA ATGGAGGTCA TGAGCTATAC CGCGATAAAA
GCTCTTGCAC GTCTTGCCGG GGGGTGA
 
Protein sequence
MSEISRGTRD RIGSRADELY PLVRDIRRDI HRHPELSFQE FRTTALVRDY LENLGFEFAP 
RYLETGVVAL LRSLNPSAQH ERVVVLRADI DALPLQEENI SDFCSGEAGC MHACGHDMHT
AILLGTASLL SEFRHELPGD ILFVFQPAEE KAPGGAKPMI EAGLFRDYTP AMIFALHCFP
HIRSGNVALR EGSLMAAADE LYITVHGEGG HASAPHKAAD PILASAHIIT ALQHLVSRVS
SPYEPAVLTI SSISGGHATN VIPENVVMSG TMRIMNEELR STFHHRLKKT VEQVADALGV
SAELDIVHGY PVLVNDAAAF GLARDAAEEM LGASHVEESE PLMTAEDFAW YLQECPGAFI
QLGTGRNEDR KGDQLHSPYF DPDEAALKTG MEVMSYTAIK ALARLAGG