Gene Paes_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1980 
Symbol 
ID6459904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2172684 
End bp2174066 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content50% 
IMG OID642725965 
ProductPeptidase M23 
Protein accessionYP_002016639 
Protein GI194334779 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00339871 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATG ACCACAGACC GCTCTTTTCC TCAAGAAAAG GGTTCTCGCA CAACAAAAAG 
TCATTCTACA TAAAAGCATT CACGATCATA TCGCTCTCTG TTCTGGGACT CGGCTATGGC
ATCAAGAGCA TAACCCCCGC TGCAGAAAAA CTCTACAGTT CGATCTTTGT CAACTATGAC
GATGAACTCG GCATCACCGA AGAGTCAGCT GTCGTCGAAA TCGAACAGGG AGAAGTCCCC
GAAAAAAACA GCATCGTCGA AAACAAGATC CAGCGAGGAG AATCCCTCTA TATTCTGCTC
ACAGCCAATG GCCTCTCCCC TGCCGAAGTC AATGAGGTAT CACAGCAGCT CAAAGGAAAA
TTTTCCATCA AAAGCCTTCG CCCGGGCCAA AGCTACCGGG TCGAAAAAGA TCCCGAGAAC
AGGTTTCTCA GCTTTTCCCT GCAGCAGAGC CTGATAAGCA CGCTTCACCT GGAAAAGAAT
CTGGAATCGG GAACATTTCA TGTCTGGCAG GAAATCATGG AATATGATAC CAGACTCGCT
TCGCTCACAG GAACCATTGA ATCAAATCTG TCGCTCGAAC TCCAGAAAAA CAAACGCTAT
AGCCTGATCG GCCAGCTCCA GAACCTCTTT GCTTCAAAAA TCAACTTCAG ACGTGATATC
CATCCGGGAA CAACCTACAA GATTCTCTAC GAAGAAAACT GGCTCGGAGA AGACTACGCC
AATAGCGGCA ACATCATGGC CGCAGAGATC TCCTTTAATG GCAATACCTA TACAGCCTAC
CGCTATACCG ATGCACAAGG AAAGACCGGC TACTACGATG AAAACGGCCG TTCTCTCAAT
AGCTTCTTCC TTGCAAAACC CTGCAACTAC TCAAGAATAT CAAGCAGTTT CGGCTACAGA
CGGCACCCCA TACTCGGAAG GAAACACTTT CACGGAGGCA TTGACCTTGC CGCTCCGACC
GGTACCCCGG TCTATGCAGT CGCTGACGGA AAAATTGTGT ACCGTGGCCG CAAAGGTGCT
GCAGGCAATA TGATCACCAT CAACCACAGC AACGGCTACT ACACCAAGTA CCTTCACCTG
AGCCGTTTTT CATCGAAACA CCCCTACGGC AGCAAGGTTC ATCAGGGAGA TATCATCGGC
TACGTCGGTT CAACCGGCCG ATCAACAGGG CCACATCTGG ACTTCCGGGT CATCAAAAAC
GGAAAACTGC AGAATCCTCT GACCGCGCTG AAGAGCTCCA GTTCGACACG AGGCGTCTCC
AAAGCCGAAA TGCAGAACTT CATGGCTCAG CTCAGCGTCT TCCGCGCCCA GCTGAACGAA
AGCAATGTCC TCGTAGCCAA TCTCTCGAAG AAAAGCATCG AGACATCAAC AGCTTTGAAC
TGA
 
Protein sequence
MNHDHRPLFS SRKGFSHNKK SFYIKAFTII SLSVLGLGYG IKSITPAAEK LYSSIFVNYD 
DELGITEESA VVEIEQGEVP EKNSIVENKI QRGESLYILL TANGLSPAEV NEVSQQLKGK
FSIKSLRPGQ SYRVEKDPEN RFLSFSLQQS LISTLHLEKN LESGTFHVWQ EIMEYDTRLA
SLTGTIESNL SLELQKNKRY SLIGQLQNLF ASKINFRRDI HPGTTYKILY EENWLGEDYA
NSGNIMAAEI SFNGNTYTAY RYTDAQGKTG YYDENGRSLN SFFLAKPCNY SRISSSFGYR
RHPILGRKHF HGGIDLAAPT GTPVYAVADG KIVYRGRKGA AGNMITINHS NGYYTKYLHL
SRFSSKHPYG SKVHQGDIIG YVGSTGRSTG PHLDFRVIKN GKLQNPLTAL KSSSSTRGVS
KAEMQNFMAQ LSVFRAQLNE SNVLVANLSK KSIETSTALN