Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4751 |
Symbol | |
ID | 5736595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6058133 |
End bp | 6059950 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281916 |
Product | oligoendopeptidase F |
Protein accession | YP_001547510 |
Protein GI | 159901263 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR00181] oligoendopeptidase F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACCG CCGCAACAGT GCCAAGCCGT CAGGAAGTAG CCCTCGCCGA TACTTGGGAT GCAACCAAAG TTTTCGCTTC TGATGCTGAA TGGGAACAAG CACTTGATGC CTATGCCCAA TCGATCGCCG ACATCGAGCA ATACCAAGGC CGTTTGGCTG AAGGCCCAGC AACGCTTTTA GCCGCGCTGA ATTATTCCGA TCAGCTTAAC GAGCGGGTTG GCAAGCTCTA TATTTATGCC AGCCTGTTTT ATAGCGTCGA TACCACCGAC CAAGTAGCCA AAGCCAAAAT GGATCGGGTG TTGGGGGTTT ATTCGCGCAC CATGGCTAGC TCGGCGTTTA TCGAGCCAGA AATTATTGCG ATTGGCTTTG CAACGCTCAA GCAATGGCAA GCCGAAAACG CCGATTTGGC TCGCTATGGT CAATATTTTC ATGATGTTGA GCGGCGGCAA GCTCATGTGC GTTCAGCCGA AGTTGAACAA GTATTGGGCT TGGTCAGCGA TGCGTTCGAT TCTGCCAGTT CAACCCATCG CATTTTGACC GATGCTGATT TGGCCTATCC CGCAGCCCAT AGCACGAATG GGCTTGAATT AGAGCTAACT CAAGGCACAA TTGGCCTGTT GGTCACCGAC CCTGATCGGG AGGTGCGCCG CACAGCCTTT GAAAACTACA GCGATGCCCA TCTGGCCAAC AAAAATACCA TGGCCAATTG TTTGGCGACT TGTGTCAAGC AGCATGTCTT CAATGCTCGC GTGCGCAACT ATAGCTCAGC TTTAGCCGCC GCCCTTGGCG AAAACAACAT CCCAACCGAT GTCTTTCATC AATTGGTTGC CACGGTGCGC AAAAACTACC CAACTTGGCA TCGCTACTGG CGCTTACGCC GTCAAGCCTT GGGCTATGAT CAATTGCATG TCTACGACAT CAAAGCGCCA TTAACCACCA AAAAGCTTGA AATCCCTTAC ACTGAAGCCG TCGATTGGAT TATTACGGGG ATGCAGCCGT TGGGCGAAGG CTACACCAGC GTGGCCAAAC GTGGCCTGTT AGAAGATCGC TGGGTTGATA TTTACCCTAA CAAAGGCAAG AGTGCTGGGG CATTTTCGAG TGGTTGGAAG GGCACGCCAC CTTATATCTT GATGAACTAC AACAACGATA TTTTTGGGAT GAGCACCTTG GCCCACGAGC TTGGTCACTC GATGCACTCG TATTTGACCT GGCAAAACCA GCCAACAATT TATAGCAACT ATGGTTTGTT TGTGGCCGAA GTAGCTTCAA ACTTCAATCA AGCTCTAGTT CGCAACCACC TGTTTAATAC CAACCAAGAC CCCGATTTCC AAATTGCCTT GATCGAAGAA GCCATGAGCA ATTTCCATCG CTATTTCTTC ATCATGCCAA TTTTGGCCCA ATTTGAGCTA GAAATCCACG AGCGTGGTCA GCGTGGCGAA CCATTAACCG CTGCAACCCT CAACGGCATC ATGTTCGATC TGTTCCGTGA AGGCTATGGC GAGGAAGTTG TGCCCGATGC TGATCGTATG GGGATTACCT GGGCAACCTT CTCGGGCCAT ATGTATGCCA ACTTCTATGT CTATCAATAT GCCACGGGGA TTGCCGCCGC TCATGCGTTG GCCGAGGGGG TATTGGCGGG CAAGCCCGAT GCCCAAGCCA ACTATTTGGC CTTCTTGAGT GCTGGTAGCT CGTTAGCACC ATTGGATGCG CTGAAATTGG CTGGGGTCGA TATGACTTCC GCCGAGCCAG TTGAAGCCGC CTTCCGCGTG CTGGCCAGCT ATGTTGATCG CCTAGAACAA TTGCTGGGCA ATCAATAA
|
Protein sequence | MTTAATVPSR QEVALADTWD ATKVFASDAE WEQALDAYAQ SIADIEQYQG RLAEGPATLL AALNYSDQLN ERVGKLYIYA SLFYSVDTTD QVAKAKMDRV LGVYSRTMAS SAFIEPEIIA IGFATLKQWQ AENADLARYG QYFHDVERRQ AHVRSAEVEQ VLGLVSDAFD SASSTHRILT DADLAYPAAH STNGLELELT QGTIGLLVTD PDREVRRTAF ENYSDAHLAN KNTMANCLAT CVKQHVFNAR VRNYSSALAA ALGENNIPTD VFHQLVATVR KNYPTWHRYW RLRRQALGYD QLHVYDIKAP LTTKKLEIPY TEAVDWIITG MQPLGEGYTS VAKRGLLEDR WVDIYPNKGK SAGAFSSGWK GTPPYILMNY NNDIFGMSTL AHELGHSMHS YLTWQNQPTI YSNYGLFVAE VASNFNQALV RNHLFNTNQD PDFQIALIEE AMSNFHRYFF IMPILAQFEL EIHERGQRGE PLTAATLNGI MFDLFREGYG EEVVPDADRM GITWATFSGH MYANFYVYQY ATGIAAAHAL AEGVLAGKPD AQANYLAFLS AGSSLAPLDA LKLAGVDMTS AEPVEAAFRV LASYVDRLEQ LLGNQ
|
| |