Gene Paes_1925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1925 
Symbol 
ID6460035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2107475 
End bp2108719 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content51% 
IMG OID642725910 
Productpeptidase U32 
Protein accessionYP_002016584 
Protein GI194334724 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.984732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACCT TAAAGAAACC GGAACTGATC GCTCCTGCCG GAGACAGAAC CTCGCTCATA 
ACTGCTCTGA ATGCAGGAGC TGACGCAATC TATTTTGGCG CAGAAAGCTG CAACATGCGC
GCCAGAAGCC GCAGTTTCAA CAGCCGGGAT TTCCATGATA TCGCATCACT CTGCCAGGAA
CATCAGGCCA GGTCCTATCT TGCGCTCAAC ACTCTCGTCT TCGATAACGA AATTGAGATG
GTCAACATGA CGGTCGCAGC AGCAAAAAAG GCCGGCATCG ATGCCGTCAT CTGCTGGGAT
GTGGCCGCTA TCAATGCCTG CAGAACTCAG AATATGCCGT TCCATCTCTC CACCCAGGCA
TCGGTCAGCA ACTACAGCGC AGTCTCGTTC TATGCCTCTC TCGGTGCAAA GATGATTGTG
CTTGCCCGCG AACTGACGAT TGAACAGGTA AACTCCATCA CCGAACAAAT TCGAAAGGAC
AAGCTCGATG TCGCTATCGA ATGCTTCGTT CACGGTGCAA TGTGCGTCGC TGTTTCAGGC
AGATGTTTTC TTTCGCAGGA TATTTTCGGT CGTTCCGCCA ACAGGGGCGC ATGCCTGCAG
CCCTGCCGCC GCCGTTACCG GATTATCGAT GAAGAAGAGG GCTTTGAGCT GGAAATCGGA
ACCGATACGG TAATGAGCCC CAAAGACCTC TGTGCGATCT CCTTGATGCC TCTGCTTATC
GACTCTGGAA TTACCGGTTT CAAGATAGAA GGACGCAACC GCAGTCCTGA ATATGTCCAT
ACCACGACAT CCTGCTACCG CCAGGCCATC GACTACATTG TAGCACACCG CCATGAAACA
TCATTCGAAG AAGATTATCA AGCAGTTGCC GACAATCTTA CCAGGCAGCT GAAAACAGTC
TACAACCGTG AGTTTTCGAG CGGATTTTAT TTCGGCAAGC CCTTCGATGC CTGGACAAGG
AACTACGGTT CGAGAGCGAC CGAAAAAAAG GTCTATCTCG GCACCGTTCT CAAGTACTAC
CCTCAAGCAG AAATCGCTGA AATCCTCATC CACTCGGAAG AACTGAAGCG GCATGACAAA
CTCTCGATAC AGGGCACGAC AACAGGGATT GTCATCCTCA ACGCTGAATC ATTTCACGCA
TCCGATAAAC CCGCAGATTC CGCCGAGAAA GGCGAAGTAG TCACCCTGCC CTGCAACCGA
AAAGTCAGAA AAAACGACAA AGTATATCTG CTTCGACCAA TCTGA
 
Protein sequence
MHTLKKPELI APAGDRTSLI TALNAGADAI YFGAESCNMR ARSRSFNSRD FHDIASLCQE 
HQARSYLALN TLVFDNEIEM VNMTVAAAKK AGIDAVICWD VAAINACRTQ NMPFHLSTQA
SVSNYSAVSF YASLGAKMIV LARELTIEQV NSITEQIRKD KLDVAIECFV HGAMCVAVSG
RCFLSQDIFG RSANRGACLQ PCRRRYRIID EEEGFELEIG TDTVMSPKDL CAISLMPLLI
DSGITGFKIE GRNRSPEYVH TTTSCYRQAI DYIVAHRHET SFEEDYQAVA DNLTRQLKTV
YNREFSSGFY FGKPFDAWTR NYGSRATEKK VYLGTVLKYY PQAEIAEILI HSEELKRHDK
LSIQGTTTGI VILNAESFHA SDKPADSAEK GEVVTLPCNR KVRKNDKVYL LRPI