Gene Paes_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0472 
Symbol 
ID6460057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp506174 
End bp507238 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content52% 
IMG OID642724471 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_002015175 
Protein GI194333315 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value4.024e-10 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT CCCGTTCAGG AGCGATAATC TTCTGTGTTC TCAGCCTGTT ATTATTCAAC 
GGCTGCTCTT CCGGCAAAGA GGCGCTTGAA CAGTTTTCAG GTCGCAGGGT TCAGCTGTTT
TTTGCTACGG ATCGCAACGA TACAGGACGT GCGGAGCCCG ATTCACGTTT CGGGAGTGAG
CGCGATTCAC TGCACTATGG AACCACAGTA ATCTCTTTTC CTGCCGATCA CAGGATTGGG
CAGCTTGAAT CAGCATTACT GGCCGATGAT GCACTTGAAC ATGTGCTGCT CAAGGAGGTG
AACAGTATGT CGGAGGAGCG TTTTTTTGGT CTTCTCACCC AGTCGGTCAA CGCTTATGAC
AAGCAGGAAT TGCTTGTGTT TGTGCACGGC TTTAATATGA GCTTTGCAAA GGCTTCTCGT
CGTTTCGGAC AGATAGTTTC AGATCTTGGC TACAGGGGAT GTCCTGTCTA CTTCAGCTGG
CCCTCAAGGG GGGCGATCAA CAGGTATGCG GCAGATAAAA CCTGTGTCGA GTGGTCTACG
CTGAACCTGG AGTGTTTTCT CGAACAGCTT GCAGTGCGTT CGGGCTCCCG GAAGATCATC
CTGACTGCGC ACAGTATGGG AGGCAGGGCT CTGACCTATG CATTCAGCTC ACTTCTGCAG
CGGCGGCCGG ATCTTGCAGA TCGTTTCGGA GCTCTGCTGC TCTATGCGCC GGATATCGAC
AGCGGTGTTT TCAGGCGTGA TATAGCGCCG GTCCTTTCAG CTTCGGGTGT ACAGGTTACA
CTCTATGTTT CCGGCAGAGA TCGGGCGCTG AAGGTATCAA GGCGTCTTAA TGGCTATGAG
CGGGTGGGTT ATGTGGAGGA CCGTCCGCTG ATTGAGAACG GGGTAGAAAC CATTGACGTG
GGTAATGTGA AGGCAGGATT TTCAGGCCAC TCCTACTATC ATCAATCACG GCCTGTGCTT
TCTGATATGT TCTACCTTAT CAATGAAGGG TTGCCGGCTG ACCAGCGTTT TTCTCTTGAA
CCGGTCGATA CTTCAGATGG CCGTTACTGG CGGTTCAGGC GGTGA
 
Protein sequence
MKKSRSGAII FCVLSLLLFN GCSSGKEALE QFSGRRVQLF FATDRNDTGR AEPDSRFGSE 
RDSLHYGTTV ISFPADHRIG QLESALLADD ALEHVLLKEV NSMSEERFFG LLTQSVNAYD
KQELLVFVHG FNMSFAKASR RFGQIVSDLG YRGCPVYFSW PSRGAINRYA ADKTCVEWST
LNLECFLEQL AVRSGSRKII LTAHSMGGRA LTYAFSSLLQ RRPDLADRFG ALLLYAPDID
SGVFRRDIAP VLSASGVQVT LYVSGRDRAL KVSRRLNGYE RVGYVEDRPL IENGVETIDV
GNVKAGFSGH SYYHQSRPVL SDMFYLINEG LPADQRFSLE PVDTSDGRYW RFRR