Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1183 |
Symbol | |
ID | 5733076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1358498 |
End bp | 1361683 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278323 |
Product | peptidase M14 carboxypeptidase A |
Protein accession | YP_001543959 |
Protein GI | 159897712 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCGTC TGAGGCATCG TTGGTCGTTA ATTGGGACAA TCCTAGCGTT AATTGGCTTG TGGAGTAGCT TGGTGGTGAT TTCGCTGCCC CAACGAACCC AAGCCCAGCC TGTGGTCGAG GAACAACGGA TTGTGGCGCG GATTGAGGCC AAAGATCGCG CTGATTCACT GGCACTTAGT GCGCGAGGCC TCGATTTGTT GGAAATGCGC GATAAGCACG ATTTGTTTGC ATTGATTACG CCAAGCGAAT TGGCTAAATT GCAACAAGAA GGCTTTGTCG CTGAAATTGA TCAAGAGCAA ACTCGTTTGT TGCAAGAACC TTCAATCATG CCAGTCCAAG GTGGATTCCG CACGGTTGAA GAAGGCTATG CCTTGCTTGA TCAATGGCAT GCAACCTATC CCAACCTAAC GGATTTGTTC ACCTATGGGA CTTCATGGGA TAAAGTGACC GCTGGTGGGC CAGCAGGCTA CGATTTGCGT GGGATCACAC TGACCAATTC GTTGATTCCT GGGCCAAAAC CAACCTTCTT CTTAATGTCG GCGATTCATG CCCGTGAAAT GTCAACTGCT GAATTGACCT TGCGCTATAC CGAGTATTTG CTTTCGCGCT ATGAAACCGA CCCCGATGTG CATTGGTTGC TTGATGAACA CACAATTGTG ATTGTGCCTT TTGTCAACCC CGATGGCCGC AAGATTGCCG AGCAAAGCTT ATCGCAACGC AAAAATCGCA ACACGGTTGA TACTTCAAGT TGTAGCGGCG TGAATATTGG GATCGACCTC AACCGCAACT CATCGTTCCA CTGGGGCGAA GTTGATAGCC CGAATGGTGA TCGTTGTGGC GCAACATGGC CTGGCGTTTC AGCTGCTTCA GAGCCAGAAG TTGCCACCTT ACAACAATGG ATTCGTGGCG TATTTGCCGA TCAACGTGGG CCAAGTGATA CTGATCCTGC GCCAGATACC ACAACCGGGG TCTATATCTC AATTCACTCA TATAGCGATT TGGTCTTGTG GCCATATGGT CACTCAGCCC AACTTGCGCC AAACGATGCC GATTTGCGTG GTTTGGGCAA GAAATTCGCC AGCTACAACG GCTACACACC GCAAAAATCC GACGAACTGT ATCCAACCAG TGGTACAACC GACGATTGGG CCTATGGTGA GTTAGGGGTA GCGGCCTATA CCTTTGAAAT TGGGCCAGAA TCAGGCACAT GTAGCGGCTT CTTCCCAGCA TTTACCTGTT TGGATGGCCA AGCTCCTGGT AATTTCTGGG GTCGCAACTT GCCTGCCTTC TTGTATGCCT CGAAAGTTGC CCGTACACCA TATTTGTTGC AACGTGGTCC CGATGCTTTG AATGTAACCG CTCAATCGAT GAGCAATGGT TACAAATTGC TGGCAACGAT TAATGATGTA ACCAATGGCA ACCAAACAAT TGCTGCTGCC GAAGCCTATG TTGATACACC ACCATGGCGA GCTGGGGCAA CTGCAATTAG CCTGAGCGCA ACCGATGGCA GTTTCAACAG CACCCAAGAA GCGGTCAATG CAACCATTCC GCAAACCTTG AATGCTGGTC GCCACTTAGT CTATTTCCGT GGGCGCGATG CTGCGGGCAA CTGGGGGCCG GTTAGCGCTC AATGGCTCGA TGTTGCACCG CAAGGCTTGG TTGGGTTTGT CCGCGCAAGC GATAACAATC AGCCAATTGC CAATGCAACG GTCGTCGCCA CAACTGGCAC GTTTACCAGC ACGACGACCA GCGGCGCTGA TGGCAGTTAT CGTTTGGAAT TGCCAGTTGG TAGCTACACG CTCAAGGCCA GTGGCACAGG CTTGACTCCT GCTAGCTACA ACCTGACTGT TAGCAGCAAT AGTTTCACAA CCCAAGATAT TAGTTTGGCG CAGTTGGCAG TCTTGACGAC CTCGCCCAGT CCATTGACCT TCAACGTGGC CAGTGGCAGC CAAGATCGCA CGTTGGTGGT GGGCAATGCT GGTGGTACAA GCTTGAATGC AGCCATCTCA CTCGCTCCAA CTGGCTATGA AGTTAAGAGC AGTGACGATG CTGGTGGCCC AAGCTATACT TGGAACGACA TTAGTAGCAC AGGTACACGC CTCAGTTTGG GCGATGATAC CTGTTCGGTG GTGAACTTGC CAAGCAGCTT CAATTACTAT GGCACCGCCT ATAGCAAATT GATTGTCAAC AGCAATGGTT TTGTTAGCCC AACCAATGCC ACTACCTGTA GCTCAACTGG TACATCGACC AACGGCGTTG TGCCAAGCAC GTCAACGCCC AACAATGTGA TTGCAGCCTT GTGGGACGAC CTTGATCCTG AAGGTTTGAC GGGCACGAAC GGGGTCTTTA CCTATAACGA TAGTGCCAAC AATCAATTTA TTGTTGAATT TAGTGGTGTG CCACACTGGG CAAACAATGG CAACTTCAGC CCCGAAGATT TCCAATTTGT GCTGAATCTG ACAACTGGCG ATGTTACGCT CAATTATCAA AACATTGATA CCCAAAATAG CGTCAGTGTT GGTATCGAAG ATAGCACTGG GGCCAATGGC TATCAGTGGG TCTATAACAG CACTGGCCGT TTACACGATA ATTTGGCAAT TCAATTTGCG GCCTATGCTG GCAGTGCCCC ATGGCTCAGT TGGACACCTA GCAACATTGA TGTAGCAGCG CGTGGTTCGA CGAACGTCCA AGTAACCGCC AATGCTACAG GCCTTGCCAA TGGCACCTAT CGCACTCGCT TACGAGTTAA CGCAGGAGCC AACACGATCA ACGGCGACCA AACGGTTCCA GTGGTTTTGA ATGTTGGTAG CTCGACAGTC CATGATGTTG CAGTTAGTGC ACCGCAAGCA GCCTTGAGTG GGTTCGTTGG TTCAACCATC ACCTACACGC TGAGCGTCAC TAACACAGGC AATGTCAGCG ATAGCTTTAA CTTGAGCTTG AGTGGCAATG TTTGGCCAAC AACCTTGAGC CAAACCAGTG TTAATTTGGC GGCTGGTGCA AGCACTACAA TTCAAGTGTC GGTGGCAATT CCGGCCAATG CTGCCGCCAA TAGCACGGAT AGCGTGACTA TTACGGCAAC GTCGGCTGCC GATAGCAGTG CAACCAATAG CATCAGCTTG GTTTCAACCG CTAATAGCAT TCCAGTCAGC CAATATAAGG TATTTATGCC CTATATTGTG AAGTAG
|
Protein sequence | MDRLRHRWSL IGTILALIGL WSSLVVISLP QRTQAQPVVE EQRIVARIEA KDRADSLALS ARGLDLLEMR DKHDLFALIT PSELAKLQQE GFVAEIDQEQ TRLLQEPSIM PVQGGFRTVE EGYALLDQWH ATYPNLTDLF TYGTSWDKVT AGGPAGYDLR GITLTNSLIP GPKPTFFLMS AIHAREMSTA ELTLRYTEYL LSRYETDPDV HWLLDEHTIV IVPFVNPDGR KIAEQSLSQR KNRNTVDTSS CSGVNIGIDL NRNSSFHWGE VDSPNGDRCG ATWPGVSAAS EPEVATLQQW IRGVFADQRG PSDTDPAPDT TTGVYISIHS YSDLVLWPYG HSAQLAPNDA DLRGLGKKFA SYNGYTPQKS DELYPTSGTT DDWAYGELGV AAYTFEIGPE SGTCSGFFPA FTCLDGQAPG NFWGRNLPAF LYASKVARTP YLLQRGPDAL NVTAQSMSNG YKLLATINDV TNGNQTIAAA EAYVDTPPWR AGATAISLSA TDGSFNSTQE AVNATIPQTL NAGRHLVYFR GRDAAGNWGP VSAQWLDVAP QGLVGFVRAS DNNQPIANAT VVATTGTFTS TTTSGADGSY RLELPVGSYT LKASGTGLTP ASYNLTVSSN SFTTQDISLA QLAVLTTSPS PLTFNVASGS QDRTLVVGNA GGTSLNAAIS LAPTGYEVKS SDDAGGPSYT WNDISSTGTR LSLGDDTCSV VNLPSSFNYY GTAYSKLIVN SNGFVSPTNA TTCSSTGTST NGVVPSTSTP NNVIAALWDD LDPEGLTGTN GVFTYNDSAN NQFIVEFSGV PHWANNGNFS PEDFQFVLNL TTGDVTLNYQ NIDTQNSVSV GIEDSTGANG YQWVYNSTGR LHDNLAIQFA AYAGSAPWLS WTPSNIDVAA RGSTNVQVTA NATGLANGTY RTRLRVNAGA NTINGDQTVP VVLNVGSSTV HDVAVSAPQA ALSGFVGSTI TYTLSVTNTG NVSDSFNLSL SGNVWPTTLS QTSVNLAAGA STTIQVSVAI PANAAANSTD SVTITATSAA DSSATNSISL VSTANSIPVS QYKVFMPYIV K
|
| |