Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1815 |
Symbol | |
ID | 5733673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2110732 |
End bp | 2112978 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278958 |
Product | peptidase C11 clostripain |
Protein accession | YP_001544586 |
Protein GI | 159898339 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02806] clostripain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00011349 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAGA TTGCGCCGCG ACGTTGGCGT TTGACTCGTT TTAGCCTCAT TTTACTCTTG CTCTTAGCCG CTTGTGCCGA TCTCTCCGAA CCAACTCCGG TGGCAAAACG CACTCCGATT GCTGGCCAAG CTACGCCCGC CGCCCCAACT CGCACCCCTG CTGGCCGCGA TGATCAGAGT TGGCTGATTA TGCTCTACTC CGATGCCGAC GATGAGATTC TTGAAGAAGA TATGCTCAAC GACATCAACG AGGCTGAACT GGTTGGCTCA ACCGATCGGG TGCGGGTAGT AGCTCAGGTC GATCGCTATG ATGGCGGCTT CGATGGCGAT GGTGATTGGA CGAGCACCAA ACGCTTTTAC ATCGAACAAG ATGACGACCT TGAGCAAATG AACTCCAAGG AACTCGCCGA CCTTGGCGAA GTCAATATGG CCGATGCCGA TACCTTGACT GATTTTGTCA CATGGGCCGC CAAAACCTAT CCCTCGGACA AATATGTGTT AATTATGTCT GATCATGGGG CTGGCTGGCC GGGCGGCTGG AGCGACCCTG ACCCTTCAAC CACAGGTCGC CACGATATTC CGCTGGCCGA GAGCTTTGGC GATATGCTTT TTCTCATGGA AATGAGCGAG GCACTCGAAC ATATCATCGC TGAAACCAAT ATTGGCGAAT TTGAGTTAAT TGGCTTTGAT GCCTGCTTGA TGAGCCATGT TGAGGTCTAT AGTGCGATCG CCCCCTATGC TCGCTATGCT GTGGCCTCGC AAGAAGTCGA GCCATCGCTG GGCTGGGCTT ATGCTGCAAT TTTGGGGCGA TTAACCGATA GCCCTGAAAT CGATGGCGCT GAGCTTTCGC GAGCAATTGT CGATAGCTAT ATCGAGCAGG ATCAACAAAT TCTCGATGAT GATGCCCGCG CCAAATATGT TTCGCGCACC TACGATTTTG AGGGCAATGT TTCTGCTGAA GAAGTTTTAG AGCAAGAGCG CAAGGCCATC ACCCTGACGG CAATCGATTT GGGCAAATTG CCTGCGGTGA TCGATGCACT TGATCGTTTG GTGATTACTT TGGCCGAGGC CGAACGCAAA GACATTGCAG CGGCACGACG CTATACCCAA GCGTTTGAGA GCGTGTTTGA TAGCGATCAA CCTAAGCCCT ACATTGATCT TGGGCACTTT GCCCAATTGC TCAAGCAAAA AGTTAATGTG CCCAGCGTCA ACAAAGCCGC TGATGAGCTG ATTGCAGCGA TTGATCGCAG TTTGATCGAA GAGAAACATG GCGATGAAAA GGCTGGGGCA ACGGGCATTT CGATCCACTT TCCCAATTCC AAACTGTATA CCAGTGCCGA TGCTGGCTAC AAATCCTATA ACATGGTTGC CGTAAACTTC GTCAACGATT CGCTGTGGGA TGAATTTTTG GCCTTCCAAT ACGCCAAAAA GCCATTGCCA ACCACCATTG AGCAACCAAC CGCAACCGTC GAACGTCAGC CAACCCCAAC GCCTGAGCCA ATTGATGTGA CCGAGGTTGA AGCACCTGGC TCCGAGCCAA TTACGGTTGC TGCGATTGAA CTTTCAAGCA CAACCGCCAG TCTCGATCAG CCTGTTGTGC TCACCAGCAG CATCACTGGC GATAATATTG CCTTCGTTTA TATCTTCATC GGCTACTATG ATCAAGAGTC CGATTCAATT CAAGTTTTGG ATATGGACTA CCTTGACGCT GAACAAACCC GCGAAATCGG CGGAGTCTTT TATCCCGATT GGGGCGAGGC CTCGACGATC GATATTGAGT TTGAGTGGGA TACGCAAGTT TTTGCCATGC ACAACGAAAC CACCGCAAGT CTAGCCTTGT TTAGCCCCGA AGATTATGGA GCATCGCCCG AAGATGCCAC CTACACCGTC GAAGGCTTGT ACACCACTGC CAAAGGCAAA AAGACTCGAC GCGCGTTGTT GTTGTTCAGC AATGGCGAGT TGGTACAAGT GCTCGGGTAT ACTGGCAAGG AAGATACTGG TTCGTTGCGC GAAATCAACC CTAAGCGTGG CGATAAATTT GTGGTGCTCG ATACTTGGCT TGAAGAATCA CAACAAACTG GCGAGAGCGA ATTTGTCAAT TACGAAGGCG AAACCTTTGT GTTTGGCGAT GACAACTTTA CTTGGGAGCT TGAACCAGCT CCTGCTGGGA ATTACCTCGT TGGCTTCTTC GCTGAAGATT TTGATGGTAA TGTCTACTCA GCCTATGAAA CCTTGATTAT CGAGTAA
|
Protein sequence | MQQIAPRRWR LTRFSLILLL LLAACADLSE PTPVAKRTPI AGQATPAAPT RTPAGRDDQS WLIMLYSDAD DEILEEDMLN DINEAELVGS TDRVRVVAQV DRYDGGFDGD GDWTSTKRFY IEQDDDLEQM NSKELADLGE VNMADADTLT DFVTWAAKTY PSDKYVLIMS DHGAGWPGGW SDPDPSTTGR HDIPLAESFG DMLFLMEMSE ALEHIIAETN IGEFELIGFD ACLMSHVEVY SAIAPYARYA VASQEVEPSL GWAYAAILGR LTDSPEIDGA ELSRAIVDSY IEQDQQILDD DARAKYVSRT YDFEGNVSAE EVLEQERKAI TLTAIDLGKL PAVIDALDRL VITLAEAERK DIAAARRYTQ AFESVFDSDQ PKPYIDLGHF AQLLKQKVNV PSVNKAADEL IAAIDRSLIE EKHGDEKAGA TGISIHFPNS KLYTSADAGY KSYNMVAVNF VNDSLWDEFL AFQYAKKPLP TTIEQPTATV ERQPTPTPEP IDVTEVEAPG SEPITVAAIE LSSTTASLDQ PVVLTSSITG DNIAFVYIFI GYYDQESDSI QVLDMDYLDA EQTREIGGVF YPDWGEASTI DIEFEWDTQV FAMHNETTAS LALFSPEDYG ASPEDATYTV EGLYTTAKGK KTRRALLLFS NGELVQVLGY TGKEDTGSLR EINPKRGDKF VVLDTWLEES QQTGESEFVN YEGETFVFGD DNFTWELEPA PAGNYLVGFF AEDFDGNVYS AYETLIIE
|
| |