Gene Paes_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0994 
Symbol 
ID6458701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1091955 
End bp1093016 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content52% 
IMG OID642724993 
ProductCytochrome-c peroxidase 
Protein accessionYP_002015680 
Protein GI194333820 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTTA CTACTTTCCT CGGTGCCCTG ACTACGCTTG CAGTCACCAC CTCGTGTACT 
CCTGACACAG AGCAGCAGAA CAACTCAAAA AGCGCTCAAA AGCCTGCCGC AGAGCAAGAG
ATTTCCGTCA TGCGAAACGA ACCGGTAAAG CCACTTGAAC CGGCGATAAT TACCGATACT
GCAATGGTAG AACTTGGCAA AAAACTCTAT TTCGACCCAA GGCTATCGCT CTCCGGATTT
ATCTCCTGTA ACTCATGTCA CAACCTCAGT ATGGGAGGAA GTGACAACTT GAAAAGCTCG
ATCGGGCATA AATGGAGTAG AGGACCTATC AACTCGCCGA CCGTCCTGAA CTCGAACCTG
AACCTGGCAC AATTCTGGGA TGGAAGAGCA AAAGACCTTA AAGAACAGGC AGGGGGGCCG
ATAGCCAATC CTGGAGAGAT GGCATTCACC CATGAACTTG CCGTCGAGCT CTTGCAATCG
ATTCCCGGTT ACGTCGATGA ATTTCACCAG GTGTTCGCCA CCACCAGCAT TGACATCGAC
CAGGTCACGA CAGCCATTGC AGCTTTTGAA GAAACCCTCG TCACACCCGA TTCGCGATTC
GACCTCTGGC TTAAAGGCGA TGACAGCGCC ATCAATGAAA CGGAACTGAA AGGATACGCG
CTCTTCAAGT CAAGCGGATG TAGCGCTTGC CATAACGGAC CGGCGCTTGG AGGAAACTCT
TTCCAGAAAA TGGGACTTGT TGCCCCATAC AAGGCAACAA GTCCTGTCGA AGGACGTTCG
GCAGTAACCG GAAAGGATGC CGACCGGTTC TCCTTCAAAG TCCCAACCCT GCGAAATGTC
GAGCTGACCT ATCCGTATTT TCATGACGGC GAGGCGGAAA CACTGGCAGA AGCAGTCGAA
ATCATGGGCA GAATTCAGCT GGGAAGAACC TTCACCGAGG AGGAAAATGA GCAGATTGTC
GCGTTTCTGA AAACCCTGAC CGGCACCCAG CCCAGGATGG AACTACCACT TCTCCCGCCA
TCATCAGATA CAACCCCGAG ACCCGATCCG TTCAGCGACT GA
 
Protein sequence
MKLTTFLGAL TTLAVTTSCT PDTEQQNNSK SAQKPAAEQE ISVMRNEPVK PLEPAIITDT 
AMVELGKKLY FDPRLSLSGF ISCNSCHNLS MGGSDNLKSS IGHKWSRGPI NSPTVLNSNL
NLAQFWDGRA KDLKEQAGGP IANPGEMAFT HELAVELLQS IPGYVDEFHQ VFATTSIDID
QVTTAIAAFE ETLVTPDSRF DLWLKGDDSA INETELKGYA LFKSSGCSAC HNGPALGGNS
FQKMGLVAPY KATSPVEGRS AVTGKDADRF SFKVPTLRNV ELTYPYFHDG EAETLAEAVE
IMGRIQLGRT FTEEENEQIV AFLKTLTGTQ PRMELPLLPP SSDTTPRPDP FSD