Gene Paes_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1939 
Symbol 
ID6460012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2121022 
End bp2122443 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content51% 
IMG OID642725924 
Productmetal dependent phosphohydrolase 
Protein accessionYP_002016598 
Protein GI194334738 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCCG ACAGTTTTCT GTTCCGCTCG GACGGCGGCT TTATCCGCCT CCCCATATGG 
GGTCACATTC CGCTCAATAC CCCGCTCAAA AAAATTCTTT CCCATCCGAC CTTTCTTCGC
CTGAAAGGCA TCCGCCAGCT CTCGTTTTCA CAGCAGGTCT ACCCTGGAGC CACGCATACC
CGTTTTGAGC ACTCGATCGG GGTCTACCAT CTCATGAAAC TGATTCTTCA GAGAATCCAT
ACCAACCCGC TTGCCGAAAG CCTTCAGAAC GACCGTTTTT TCTTCGACGA TCACTCCTGC
AGACTGCTGC TCTCGGCCGC CCTGCTTCAC GACATCGGGC ACTACCCTCA TGCGCATGTT
CTCGAAGAAC AGGCTCCTGT CTTCCGGGGC AAACCGGTCT TCACCAGACA TGAGTCGCTG
GTCGACCGTT TTCTTTTCGA AACCAGTGAT CATTTCCCAT CAATTGCCGA CACGCTCCAC
GATGAGTGGA AGGTCGACCC GCATGAGGTA GCGGCACTGA TTCGTGGAGA GAGTGGCAAC
ACCTACAAAA AACTGATCAG CGGGACGCTT GACCCCGATA AAATGGACTA CCTCATGCGC
GACGCGCACC ACTGCAATAT CCCCTACGGA AACATCGATA TAGAACGATT GATCGAATCG
TTCGTCCCTG ACCCGGAACG ATGCCGTTTT GCTATTACAG AAAAAGGGAT AGCGCCGCTC
GAAAGTCTGC TCTTCGCCAA GTACATGATG ATGCGCAATG TCTACTGGCA CCATACAAGC
CGGACATTCT CCGTCATGCT GCGCAGGTTT CTTCAGGACA ATATCGATAC CTGCATGATC
ACGCCGCATA CGCTCACCGA GCTTTTTTAC AGCAATTCAG ATGATCGTGT GCTCTACGAT
CTCGAACACC TCCTGCCCTC CGAAAAAAAT CCGGCAGGAT CACTCCTTGA ACGTATTCAG
CAGCGAAAAA TTTATAAACG CGCCATCATT CGCACCCCGT ATCTGAATGG GGCAAAAAAG
CCTATCGACT GGATGATGGC CTACGCCACC GATCATGAAC GACGCAAAGA GCAGGAGATC
GCGCTCTGCA CCATGATCGC CAAACGCCAC AACATCGATC TCTGCGGACA TGAGATTCTC
ATCGACCCGC CATCACTGAA AGATATTTTC GACTACGATG ATCTGAGAGA ACTCTGCGTC
TTTCCAACGA AACAGGAACA TCTCCAAAGC CCCGGTAATC ATCAGAAAAA TTATATCAGC
TTCGACGAGT TCGGCGAATC GGTCTTCCGC TCGGATTTTA TCCTCGCGTT TGAACGCTAC
ACAAAAAAAT TCCGTATCGT CTGTCGAGAA GACCTGACCC CCCTTGTCCG TCAACATGAG
GAAGAGATCA TCGGGATGCT TGAAGCAAAG GGGCCGACCT GA
 
Protein sequence
MISDSFLFRS DGGFIRLPIW GHIPLNTPLK KILSHPTFLR LKGIRQLSFS QQVYPGATHT 
RFEHSIGVYH LMKLILQRIH TNPLAESLQN DRFFFDDHSC RLLLSAALLH DIGHYPHAHV
LEEQAPVFRG KPVFTRHESL VDRFLFETSD HFPSIADTLH DEWKVDPHEV AALIRGESGN
TYKKLISGTL DPDKMDYLMR DAHHCNIPYG NIDIERLIES FVPDPERCRF AITEKGIAPL
ESLLFAKYMM MRNVYWHHTS RTFSVMLRRF LQDNIDTCMI TPHTLTELFY SNSDDRVLYD
LEHLLPSEKN PAGSLLERIQ QRKIYKRAII RTPYLNGAKK PIDWMMAYAT DHERRKEQEI
ALCTMIAKRH NIDLCGHEIL IDPPSLKDIF DYDDLRELCV FPTKQEHLQS PGNHQKNYIS
FDEFGESVFR SDFILAFERY TKKFRIVCRE DLTPLVRQHE EEIIGMLEAK GPT