Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2806 |
Symbol | |
ID | 5734687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3566095 |
End bp | 3568725 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279949 |
Product | peptidase M16 domain-containing protein |
Protein accession | YP_001545572 |
Protein GI | 159899325 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.326138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTTG CGACCAATCT TCATCGATTA CCCAACGGCT TGACGGTGCT CACCCGCGAG GTGCATACCG CGCCAATCGC GACCAACTGG ATTTGGTATA AAGTTGGTGG GCGCAACGAA CGGGTGGGCA TTTCGGGCAT TTCGCACTGG TGTGAGCATA TGTTGTTCAA AGGTACACCA TCGATGCCTA AAGGCGCGTT TGATGCCACG ATTGCCCGCA ATGGCGGTAC ATTCAATGGC TTTACTTGGA TCGATTACAC CGCCTATTAT GAAACCCTGC CCGCCGATCG CCTCAGCTTG GGGTTGCAAA TCGAAGCAGA TCGCATGGTC AATTCATCGT TTGATCCTGA TGAAGTCGCT TCTGAACGCA CGGTCATTAT TTCGGAACGC GAGGGCAACG AAAACAGCCC TAGTTTTTGG CTTGATGAAG AGTTACGTTC AACCGCCTTC AAAGTCCATC CTTATCGTAA TGGGGTGATT GGCTGGAAGA GCGACTTGCG AGCCATGACC CGCGAAGATT TATACACCCA CTACAAGACT TTTTACGCAC CCAACAACGC GGTGTTGGTG GTCGTCGGGG CATTCAATAC CGATGAAGTG CTCAAACAAA TCGAAGCCTT GTATGGCCCG ATTCCCCAAG GCTTGCCCTT GCCCGAAGTG CGCGGCGAGG AGCCAGAACA GCAGGGCGAA CGCCGCGTCC ATGTTTCGCG GCCTGGCCCC AACTCAATGA TTCAAATTGC CTTCCATGCC CCACCAGCCA CCTCACCTGA TTGGGCCGCC TTGACCGTAC TCGATGCAGT GTTGACGGGT GGTAAATCGC CTTCGTTTAC AGGCGGCGGT GCTCAGACCA ATCGCTCAGC TCGTTTGTAT CGGGCCTTGG TTGAGGGCGA ATTGGCCACT GGCGCTTATT CATCGTTTAT GGCGACGCTT GATCCATTCT TATTTGAAAT TGGGGCAACC GTGCGGCCTG ATCGCACAGT CGAGCAAGTT GAGCAAGCAC TCTACACTGA AATCGAAAAA CTGCAACAAA CACCAATCAG CGAGGCTGAA TTGCAAAAAA TTCAGCGCCA GGTGCGTGCC CAACAAGCCT ATAGCCTCGA ACGAATCAGC AATCAGGCCA TGATGTTGGG CATGTGGCAA ACCCTCGATT CCTATGAACG GGCCGATAGT TCGTTGGAAG AGATTGCAGC GGTGACTGCC GCCGATGTTC AACGAGTGGC CCAGCACTAT CTAGCCCCAC AAAAGCGCAA TATCGGGGTC TTTACGCCGA GCAATAGCAG TGGCGGAGGC CAAATCACCC CAACTGCCCG TACATTTCAC CCAGCCTTGA GTTTTTATGA TGCCAAGGCG GCTGAAGCTG AAGCTACCGC TGCCGCCAAC AATGCTCCCC AAGCAACCCG CCATGTGCTG AGCAATGGCA TTGTGGTGTT ATTGCAGCGC AATCCAAATA GCCCAACTGT TAGCATTCAG GGCGAAATCG CGCTGGGCCA AATCCATGAA TCAAGCGCAT TAAATGGCGT GGCCGTGTTC ACCGCCGCCG CACTCACTCG TGGCACAACC AGCCGCAGTT TCCACGATAT TACCAATCTT ACCGAAGATC GTGGCTGCTC AATTTCGGCT AGCGCAGGCC GCCACAGCAC TTCGTTTGGG GGCAAAGCCT TGAGCGATGA TGCGCCATTA ATCTTAGAAT TATTGGCTGA TGTGTTGCGC AACCCAACCT TCCCTGAGCG TGAAATCGAG CGTTTGCGCA CCCAATTTAC CACCATGCTG CGCCAAAGTG AGCAAGATAC CCGTTCGCAA GCATCCAAAG CTGCGCGTGA ACAACTGTAT CCCAGCGATC ATCCTTACTA TTTTTCACCC AACGGCTCGT TAGATACCGT GCCTGGGATT ACCACTGCGG ATCTTGCCGC CTTTGCCAAG CGCTATCACC CTGCCGCGAC CACGATTGCA ATTGTTGGCG ACATTGACGA AACGGCGATT TTGGCCGAAG TTGAGCGCTG GTTTGGCGAT TGGCAAGGTC AAGGCGAGCC ACCAACGACT GCCGTACCAA GCGTTGATCT GCCCCCTAGC GTGCTGCGCC GCGAAATTGA AGTCGCAGGC AAAACCCAAT CCGACCTCGT TTGGGCCGTA CCAGGTTTAG CCCGCACCGA CCCTGATTTC TATGCAGCGA TGATGGCTAA TTTGGTGCTT GGCCAATTGG GCTTGATGGG GCGTTTAGGC GAAAATGTAC GTGATAAACA AGGCCTCGCC TATTATGCCA CCAGCCGCAT CGATGCCGAT GTTGGGGCTG GCGCGTGGAT CATCTATGCT GGGATCAACG CCAAAAATGT TGATCGGGCA CTCAGCGCCA TCCAAGAGGA AGTTGATCGC TTGTTGGCTG AAGGGATTAG CGAACTCGAA CGCAGCGATT CGGTGGCCTA TCTCACAGGG ATGTTGGGGA TTAGTCTTGA GGCCAATAGT GGCATCGCCA ATATGTTGCT GAATATCGAA CGCTACAACT TAGGCCTCGA TTATGTACAG CGCTATCCCG AAATTATCGG TTCGGTTACG CTTGAGCAGA TTCATGCTGC CGCCAAACGC TTGCTTTCCA GCGAGCGCTA TGTAATTGGA GTAGCAGGAC CAGCCGCCTA A
|
Protein sequence | MTLATNLHRL PNGLTVLTRE VHTAPIATNW IWYKVGGRNE RVGISGISHW CEHMLFKGTP SMPKGAFDAT IARNGGTFNG FTWIDYTAYY ETLPADRLSL GLQIEADRMV NSSFDPDEVA SERTVIISER EGNENSPSFW LDEELRSTAF KVHPYRNGVI GWKSDLRAMT REDLYTHYKT FYAPNNAVLV VVGAFNTDEV LKQIEALYGP IPQGLPLPEV RGEEPEQQGE RRVHVSRPGP NSMIQIAFHA PPATSPDWAA LTVLDAVLTG GKSPSFTGGG AQTNRSARLY RALVEGELAT GAYSSFMATL DPFLFEIGAT VRPDRTVEQV EQALYTEIEK LQQTPISEAE LQKIQRQVRA QQAYSLERIS NQAMMLGMWQ TLDSYERADS SLEEIAAVTA ADVQRVAQHY LAPQKRNIGV FTPSNSSGGG QITPTARTFH PALSFYDAKA AEAEATAAAN NAPQATRHVL SNGIVVLLQR NPNSPTVSIQ GEIALGQIHE SSALNGVAVF TAAALTRGTT SRSFHDITNL TEDRGCSISA SAGRHSTSFG GKALSDDAPL ILELLADVLR NPTFPEREIE RLRTQFTTML RQSEQDTRSQ ASKAAREQLY PSDHPYYFSP NGSLDTVPGI TTADLAAFAK RYHPAATTIA IVGDIDETAI LAEVERWFGD WQGQGEPPTT AVPSVDLPPS VLRREIEVAG KTQSDLVWAV PGLARTDPDF YAAMMANLVL GQLGLMGRLG ENVRDKQGLA YYATSRIDAD VGAGAWIIYA GINAKNVDRA LSAIQEEVDR LLAEGISELE RSDSVAYLTG MLGISLEANS GIANMLLNIE RYNLGLDYVQ RYPEIIGSVT LEQIHAAAKR LLSSERYVIG VAGPAA
|
| |