Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3015 |
Symbol | |
ID | 5734902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3806687 |
End bp | 3809194 |
Gene Length | 2508 bp |
Protein Length | 835 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280159 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001545781 |
Protein GI | 159899534 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000302047 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTAT TTCATGGTCT TTTACTGATG AGCTTAGTCT TGCTAAGCTT CGGCGGTGCT GCTAGCAAAC CACCTAGCAC CACAATTCCA CCCAAGTTGT ATTTACAACG CGGCACGATC GATCTCAAAG TGGTTAATCA AGCCAACCAG GCTGATCCAT TATTGCAGAC AGTTGGGGGG TATGCGGTTA TTCAATTTAG TGGGCCTGTG TTGCTCAAGC AGCGGCAAGC ACTTGAAGCT ACGGGGTTGA GCATTATTGA GTATCTCCCT GATTACGCCT ATTTAGTTCG TGGCTCAGCT GCCCAACAAG CTGCTGCCAG CCGAATTGAT GGTTTTTATG CGCGTGGCGA TTGGACGTTG GCCGATAAGT TTCATCCTGG GTTGTTGAAA CTGATGCGCA CTGGTGCGTA TCAAGGTTTG GCGTTGCAGA CGATTGGTTG GGATAACCAA CTGACCACTG CTGAACAAGC AGTCAAAGCC CAAGGCTTGA AGCTTGATGC CATTAGCACG ATCGATCAAC TGATTCAGTT GGCCCAAATT AATGAAGTGC GCTGGATCGA AGTGGCGAGT ACGCCGAAAT TATTCGACCA ATATGCGCGG CAAGTTCAGC AAGTTGAGCC TGTTTGGACT GATCGGCAGT TGTATGGCCA AAATCAAATT GTGGCCTACA CTGACACAGG TTTGGATACT GGCTCCTTGA CAACCCTCAA CAACGATTTT ACCAATCGGA TTCTTGCGAC CCAAGTGCTC TCGGCTGGGG GGCATTGGGA TGATAACCAT GGCCATGGCA CGCACGTTGC TGGCTCGATT GCTGGTAACG GCGCACTTTC TGGCTCAAAC CCAGCTACCC ATACCTATAC CAACTCGATG GCAGGGATTG CGCCTGAGGC CAAGTTGGTT GTCCAAGCGT TTGAAGCAAC TGCGACCGGC GATATTATTG GCTTGCCAAC CGACCTCTAC CCAATGTATC AGCAAGCCTA TGATGCTGGA GCACGGATTC ATAGCAATAG CTGGGGCGAT GCAACCGGGC CAGTCAGTGA TACAGAAGCT GCATTTGGGG GCTACCCCTA TAATGCGCAA CGCACCGACC AGTTTTTGTG GGAACACCCC GATTATACAA TGTTGTTTGG GGCTGGTAAT AGTGGGGTCG ATGGAACCCC GAGCCAAGCA ATCTTTTGTA CTGGTGGTAA TGGCGTTGTT GACCCGGACT CGTTGCTCGC ACCTGGGACT GCCAAGAATG TGATTACGGT TGGGGCTAGT GAAAGCCCGC GCCCAACGGG TGGTTATACG GGTGTGCCTT GGCTTTTATT GAGTTTTTGC TTTGCCACCG CGCCAATTAA CACGGATACT CTTTCTGATG ATGCTAATGG GATAGCGGCA TTCTCATCAC GTGGGCCAAC TGATGATGGC CGGATCAAGC CTGATTTGGT TGCCCCTGGC ACGAATATTC TTTCAACTCG CTCATATGGC AGTGGGGCTG GTGCCTTATG GGGTGTTCAT GAAACCAATG CCAATTACCT TTATTCGGGT GGTACGTCGA TGTCCACTCC GTTGGTTGCT GGTACGGTTG CCCTCATCCG CCAGTGGCTG GGTATCCAAG GCTTGCCTAA TCCTAGTGCC GCAGTCATCA AATCAATTGT TCTGAATACG ACCGTTGATA TTGCGCCCGG CCAATATGGT ACTGGTGCAA CTCAAGAAAT TCCTTACAAC CGCCCCAATA GTGTGGCTGG TTGGGGGCGT AGCAATTTGA GTTTTATTAC CAAACCAGCG CCCTATCATT TGTGGGTTGC CGATCAGACG ACTGGCTTGA ATACAGGCCA GATGGTAAGC TATAACCATA CTGCCAGCCA ACCCCTAACC GTGTTGACCA ATACCCAACC GCTGCGAGTT ATGCTCAACT GGACTGATCC ACCAGCTTCG TTGGCAGCCA CCCAACAATT GGTTAACGAT CTTGATTTGG TGTTGATTGG GCCTGATGGT ACGCGCTATT ATGGCAATAA TCAGAGCACT GGCGATCGCA CGAATAATAC CGAAGGTGTG ATCATTAATA ATCCCCAAAT TGGTGCATAT CAGATTGAAG TTACTGCCCA TAATGTACCT ATTTCTAGCC AAGCCTATGG CTTGTCAGTC GCTGGCCCAT TGCGTGAAGC TACTGGCGGC GGTACGCCAA CTCCAACCCC AACCGCTATA GCAACCGCAA CAAATACGCC AACCAACACG GCGACCAATA CACCAACCAA CACGGCGACG AATACGCCAA CCAACACGCC AACCAATACG CCAACCAATA CACCAACCAA TACACCAACC AACACGCCAA CCAACACGGC GACCAATACA CCAACCAACA CGGCGACGAA TACGCCAACC AACACGCCAA CCAATACGCC AACCAACACG GCGACCAATA CACCAACCAA CTCGCCAACC CCCACCAATC TGCCAACTGT GACCACGACA GCGGTTACAA ACGAGTATGA TGTTTGGATA CCATGGGCGA GCAAATAA
|
Protein sequence | MRLFHGLLLM SLVLLSFGGA ASKPPSTTIP PKLYLQRGTI DLKVVNQANQ ADPLLQTVGG YAVIQFSGPV LLKQRQALEA TGLSIIEYLP DYAYLVRGSA AQQAAASRID GFYARGDWTL ADKFHPGLLK LMRTGAYQGL ALQTIGWDNQ LTTAEQAVKA QGLKLDAIST IDQLIQLAQI NEVRWIEVAS TPKLFDQYAR QVQQVEPVWT DRQLYGQNQI VAYTDTGLDT GSLTTLNNDF TNRILATQVL SAGGHWDDNH GHGTHVAGSI AGNGALSGSN PATHTYTNSM AGIAPEAKLV VQAFEATATG DIIGLPTDLY PMYQQAYDAG ARIHSNSWGD ATGPVSDTEA AFGGYPYNAQ RTDQFLWEHP DYTMLFGAGN SGVDGTPSQA IFCTGGNGVV DPDSLLAPGT AKNVITVGAS ESPRPTGGYT GVPWLLLSFC FATAPINTDT LSDDANGIAA FSSRGPTDDG RIKPDLVAPG TNILSTRSYG SGAGALWGVH ETNANYLYSG GTSMSTPLVA GTVALIRQWL GIQGLPNPSA AVIKSIVLNT TVDIAPGQYG TGATQEIPYN RPNSVAGWGR SNLSFITKPA PYHLWVADQT TGLNTGQMVS YNHTASQPLT VLTNTQPLRV MLNWTDPPAS LAATQQLVND LDLVLIGPDG TRYYGNNQST GDRTNNTEGV IINNPQIGAY QIEVTAHNVP ISSQAYGLSV AGPLREATGG GTPTPTPTAI ATATNTPTNT ATNTPTNTAT NTPTNTPTNT PTNTPTNTPT NTPTNTATNT PTNTATNTPT NTPTNTPTNT ATNTPTNSPT PTNLPTVTTT AVTNEYDVWI PWASK
|
| |