Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3115 |
Symbol | |
ID | 5734987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3929497 |
End bp | 3931308 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280259 |
Product | oligoendopeptidase F |
Protein accession | YP_001545881 |
Protein GI | 159899634 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR00181] oligoendopeptidase F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.111844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATGG TTGAAGAACA GGTACCAACC CGCGAGGAAG TAAGCGCCGA AGACACTTGG GATATTAGTA GTCTCTATGC AGACCAAGCG GCTTGGGAGG CTGATGTTGA ACGAATTAGC AGCGATTTGC TGCCAGCCTT GACCAATTTG CAAGGCACGC TCGCCAATGG TCCTGAGGCG TTGTTGGCAG TGTTTCAAGC CCAAGAAGCC CTTGGCATGG TGCTCGAACA AATTTATGTC TATGCCAGTT TACGAGCCGA TGAAGATACG GCCAACCAAC ATTACCAAGC CCTCGAAGAA CGGGCCACCG CCCTCTCGAT TAAGGCAAGC GCCGCTACCT CTTGGATTGA GCCAGAGCTT TTAGCCCTTT CCGATGAGCA AATTTTGGGC TATGTGAGCA GTTTGCCCGC CCTCGAACTT TATCGCCGCG CCTTAGAAGA GCAAATTCGT TTGCGCCAAC ACACCCGCTC TGGCGAAGTT GAAGAATTAT TGGCCCAAAC TGGCGAGATT AGCCGTGGCG CTCAAACCAC CTTCAACATG TTTAGCGATG CTGACCTCAA ATTCCCGCCG ATTGAAGATG AACAGGGCAA GCCGCTCGAA GTGACGATGG GCCGCTACGC AGTGTTGCTG GAAAACCCCA ACCAACGCAT TCGCCGCGAT ACTTTTATGA GCATTCACCG CACCTATCGC CAATTTCGTA ATATGTTGGC GGCCAATTAT GCGACCAATG TGCGCAGTAA TATTTTTTAT GCCAAAGCGC GGGGCTACGA TTCAGCCTTA GATGCCAGCC TAAAACCCAA AGAAATTCCT ATGAGCGTCT ACGATAATTT GATCAGCACG GTGCACGAGC ACTTGCCCAA ATTGCATCGT TATGGCGCAG TGCGCAAGCG CATTTTGGGG GTTGATAGCC TGCATGCCTA CGATTGGTTT GTGCCATTAA ACGGCGCAGC CCCAACCAAA ATCGACTTTG AACAAGGTGC TTCGTTGATT TTGAGCGCCT TGGAGCCACT TGGCGCTGAA TATAGCTCCA ACCTCGGTCA TGGGCTGGAA TCGCGCTGGG TTGACCGCTA CGAAAATAAA AATAAACGCT CAGGAGCCTA TTCATGGGGT TGTTACACCT CACAGCCCTT TATTTTGATG AACTACAAAA ATAACTTGAA TAGTCTCTTT ACCCTGGCCC ATGAGCTTGG TCACTCGATG CACTCGTTGA TGACCCGTAA ATATCAACCC TATACCTATG GCCATTACAC CTTGTTTGTG GCCGAAGTCG CTTCAACTTT AAACGAAGCT TTGCTGGCCG AATATATGCT CAAAACCAGC GATGACCCAG CCTTGCGCTT GCAATTGGTC ACCCAGCAAA TTGATGATAT TCGCGGCACG TTGTTGCGTC AAACCTTGTT TGCCGAGTTC GAGCGCGAAA CCCATCGCAT GGTCGAGCAA GGTGAAGCGC TAACTGCCGA TAACCTCAGT GCCTTGTATC GCCGCTTGAT CGAGCAATAC TATGGCCCCG AATTGGTCAT CGATGAAGAA TTGGATATTG AATGGGCACG GATTCCTCAC TTCTACCGCT CGTTCTATGT CTATCAATAT TCCACTGGCA TTTCAGCTGC CTTGGCCTTG GCCGATAAGA TTTTGACCGA AGGCGCTGGT GCTGCTGAAA ACTACGTCAA CTTCTTGCGA GGTGGTAATT CCAAATCATC AATCGATCTA CTCAAGGGTG CTGGGGTCGA TATGACCACC CCCGACCCAA TTCATCGAGC CATGAATCGC TTTGGCGATT TGGTGACCAA ACTCGATGAA TTAACCGCCT AA
|
Protein sequence | MTMVEEQVPT REEVSAEDTW DISSLYADQA AWEADVERIS SDLLPALTNL QGTLANGPEA LLAVFQAQEA LGMVLEQIYV YASLRADEDT ANQHYQALEE RATALSIKAS AATSWIEPEL LALSDEQILG YVSSLPALEL YRRALEEQIR LRQHTRSGEV EELLAQTGEI SRGAQTTFNM FSDADLKFPP IEDEQGKPLE VTMGRYAVLL ENPNQRIRRD TFMSIHRTYR QFRNMLAANY ATNVRSNIFY AKARGYDSAL DASLKPKEIP MSVYDNLIST VHEHLPKLHR YGAVRKRILG VDSLHAYDWF VPLNGAAPTK IDFEQGASLI LSALEPLGAE YSSNLGHGLE SRWVDRYENK NKRSGAYSWG CYTSQPFILM NYKNNLNSLF TLAHELGHSM HSLMTRKYQP YTYGHYTLFV AEVASTLNEA LLAEYMLKTS DDPALRLQLV TQQIDDIRGT LLRQTLFAEF ERETHRMVEQ GEALTADNLS ALYRRLIEQY YGPELVIDEE LDIEWARIPH FYRSFYVYQY STGISAALAL ADKILTEGAG AAENYVNFLR GGNSKSSIDL LKGAGVDMTT PDPIHRAMNR FGDLVTKLDE LTA
|
| |