Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2911 |
Symbol | |
ID | 5734782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3683244 |
End bp | 3684737 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280054 |
Product | carboxypeptidase Taq |
Protein accession | YP_001545677 |
Protein GI | 159899430 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000970624 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAG CTCTTGAGCA ACTTCGCGAC CATCTCGCAA CTATCCGCGA TCTTAATCAC ACCGCAGGAT TGCTTGCCTG GGATCAGCGC ACTCAAATGC CGCGTGGTGG GGCCAGCACG CGTGCTGAGC AATTAGCTAC CATCAGTCGA ATTTCGCATG AGCTATTCAC TGGCGAGAAA ACCCTCAGCT TGCTTGATGC GGTTGATGTT GCTGATTTAT CGCCCGACTC TGATGATGCT CGTTTGATCA GTCATACCCG CTACAATTAC GAACGTTCGA GCAAATTGCC TGCCGAATTT GTCGCCAAAC AATCGCGGGT ACGGGCATTA GCGGGCAAAG TTTGGGAAGA TGCCCGTGCC AACGCCGATT TCAGCCAGTT TCAGCCGCAT CTTGAAACGA TTGTTGAGAT GGTGCGCGAA CAAACCCAAT TTCTGGGCTA CGATGAACAT CCCTACGATG CCTTGCTCAA TAGCTACGAG CGTGGTTTGA CAACCAAGCA AGCGCTTGGC TTGTTCGATG AATTGAAGGC CGGCACTGTG CCATTGGTGC TCAAAATTGC CGCGATGGGC GACGATGGCC GCGATGCCCC GTTGCATGGC AATTATCCCG AAGCGCTCCA AGAACAATTT GGCAAAAAAG TGACCGTGCG TTATGGCTAC GATTGGAATC GCGGTCGGCA AGATCGCAGC ACCCATCCAT TTTGTAGCAA CTTTGGGCGC AACGACGTAC GGATCACCAC GCGTTTTAAT CCCAATTGGC TTTCACCAGC GCTTTTTGGC ACATTGCATG AAACCGGGCA TGCCTTGTAC GAACAAAATA TCAAGCCCGA ACTTGATCGC ACGCCGCTTG GTCGCGGTAC CTCGTTGGGT GTCCACGAAT CGCAATCGCG TATGTGGGAA AATATTGTTG GGCGTTCGCG ACCATTCTGG GAGTTTTTCT ACAGCGATTT GCAGGCGACC TTCCCTGAGC CATTAGCCAA TGTCGATCTG GAGCGCTTCT ATCGGGCCGT TAATGCGGTT AAGCCCTCAT TGATTCGCGT CGAAGCTGAT GAATTAACCT ACAATTTGCA TATTTTGCTG CGTATGGAGC TAGAAATTGC CTTGGTTGAA GGCACGCTCA AGGTTGCCGA TTTGCCCGAA GCTTGGAACG CCAAGATGCA AGCCTACCTT GGCATCACCC CAGCCAACGA TGCTGAGGGT GTACTCCAAG ATATTCACTG GTCGGGCATG ATGTTTGGCT ACTTCCCCAC CTACACTATT GGCAATGTGC TGTCGGTTCA ACTATTTGAT ACGGCAATCG CCCAACATCC TGAGATTTGG GATGAAATGC GCCGTGGTGA ATTTGGCACA CTGCTTGGTT GGATGCGTGA ACATATCCAT CAGCATGGCA GTAAATTCTT GCCTAACGAG TTGATTACCC GAGCAACGGG CCGGTCAATG GATGCAGCGC CTTACGTTAA GTATTTACAA ACCAAATTTG CTGAATTATA TTAA
|
Protein sequence | MSQALEQLRD HLATIRDLNH TAGLLAWDQR TQMPRGGAST RAEQLATISR ISHELFTGEK TLSLLDAVDV ADLSPDSDDA RLISHTRYNY ERSSKLPAEF VAKQSRVRAL AGKVWEDARA NADFSQFQPH LETIVEMVRE QTQFLGYDEH PYDALLNSYE RGLTTKQALG LFDELKAGTV PLVLKIAAMG DDGRDAPLHG NYPEALQEQF GKKVTVRYGY DWNRGRQDRS THPFCSNFGR NDVRITTRFN PNWLSPALFG TLHETGHALY EQNIKPELDR TPLGRGTSLG VHESQSRMWE NIVGRSRPFW EFFYSDLQAT FPEPLANVDL ERFYRAVNAV KPSLIRVEAD ELTYNLHILL RMELEIALVE GTLKVADLPE AWNAKMQAYL GITPANDAEG VLQDIHWSGM MFGYFPTYTI GNVLSVQLFD TAIAQHPEIW DEMRRGEFGT LLGWMREHIH QHGSKFLPNE LITRATGRSM DAAPYVKYLQ TKFAELY
|
| |