Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3111 |
Symbol | |
ID | 5734983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3924733 |
End bp | 3926151 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280255 |
Product | carboxyl-terminal protease |
Protein accession | YP_001545877 |
Protein GI | 159899630 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000358486 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAT TCCAGCCCAC CTCGGGCGGT AGCTCAACAT CATCAGCAAA GCTATGGGTG GTGTTGAGTG GGATTGTTGG AGTCTTGTTG GTGGTCGCGA TTGCCCTTGG GGCTGGCTAT TATTGGGGTA GCTCATCGAA AGAGCAGACG ATGACGGCGG CGAATCAAGC TTTAGCCACC GAAACCGCCC AAATTATGCA AGCAACCCAA CAGGCGCTGC CCCCAGCCAA TGCCGATGAA AACTTTCAAA CCTTTTGGGA AGTTTGGAAT CTGGTCAACA AAGAGTTTTA TCACACCGAG CCAATCGACG AAAAACAAAT GATGTATGGC GCAATTCGCG GCATGCTCCA ATCGCTTGGC GATGATTTTA CTGGGTTCCA AGAACCCGAA GCCGCCGAAC GCTCGCGCGA GGATATGCGC GGCAATTTCG AGGGCATCGG AGCCTATGTC GAGTATAAAG ATGGCCAGAT CCTAATTGTT TCGCCAATTG AGGGTTCGCC TGCTGAAAAA GCCAATGTGC GAGCTGGCGA TATTGTGGTC GCGGTCGATG GCAAGCAAAT TAGTGAAGTC ATCGAGAATC TTGAACGCGA TCAAGCGCTT GCAGAAGCCA TTAAGCTGAT TCGTGGCCCC AAAGGTTCGC AAGTCGTGAT TACGGTCTAT CGTACCAGCG AAGAAAAGCA AATCGATATT ACGATTATAC GCGATACGAT TCCGTTGATC AGCGTGCGCT CAAGCATGAT TGGCGATATT GGCTACATTC AATTGAGCGA ATTCAAGCAA ACATCCTACG ATGAATTAGA CCAAGCAATT GCCAAACTCA AAACCAATAA CCCTAAGGCA ATTATTTTTG ATTTGCGTAA CAATCCAGGC GGTTATGTCA ATCAAGCTCA AAATGTACTT GGACGCTTTA CCAAAGATGG GGTAACCCAC TATCAAGAAA ATAGCGATGG TACGCAAAAG GAATATCGAA CTTTGCAGCA AGGCGATGCC CAAGAATTAT TTGATCTCCC AGTTGTGGTC TTGGTAAATG GTGGCTCAGC CAGCGCCTCG GAAATCGTCT CTGGTGCGAT GCAAGATACC AAACGCGCAA CCCTGATTGG GGAAAAGACC TTTGGCAAGG GTTCGGTCCA AAGTGTGCAT ACCCTGTCGG ATAAATCGGA AGCGCGGATT ACGATTGCCC ATTGGCTTAC TCCCAACAAA CGGGCAATTC ATACGCTGGG GATTACCCCC GATTATGTTG TGCCGTTCTC GGATGATGCA ACCCAATATC CAATTGAATG TATTTTGAAT CGCACACCTG CCGATGGGGC AACCAGTTGT GCTGATTCAC AATTGTTCTG GGCGCTAAAG TTCTTGAACG AACAACAAAC CCCACCGCCA CCGCCAACCC CAACGATTAC ACCAACCCCT GGCAAATAG
|
Protein sequence | MSEFQPTSGG SSTSSAKLWV VLSGIVGVLL VVAIALGAGY YWGSSSKEQT MTAANQALAT ETAQIMQATQ QALPPANADE NFQTFWEVWN LVNKEFYHTE PIDEKQMMYG AIRGMLQSLG DDFTGFQEPE AAERSREDMR GNFEGIGAYV EYKDGQILIV SPIEGSPAEK ANVRAGDIVV AVDGKQISEV IENLERDQAL AEAIKLIRGP KGSQVVITVY RTSEEKQIDI TIIRDTIPLI SVRSSMIGDI GYIQLSEFKQ TSYDELDQAI AKLKTNNPKA IIFDLRNNPG GYVNQAQNVL GRFTKDGVTH YQENSDGTQK EYRTLQQGDA QELFDLPVVV LVNGGSASAS EIVSGAMQDT KRATLIGEKT FGKGSVQSVH TLSDKSEARI TIAHWLTPNK RAIHTLGITP DYVVPFSDDA TQYPIECILN RTPADGATSC ADSQLFWALK FLNEQQTPPP PPTPTITPTP GK
|
| |