Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0115 |
Symbol | |
ID | 5732008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 147983 |
End bp | 149488 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277237 |
Product | carboxypeptidase Taq |
Protein accession | YP_001542895 |
Protein GI | 159896648 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0714794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAC AATTAGCTGA TTTGAAGGAA AAAATTGGGG TTGCTCAAGA TTTATCGTAT GCTAGCGCTG TGTTAAATTG GGATCAAAGC ACCTACATGC CCGCTGGCGG GGCCGAAGCT CGTGGTCGCC AAATGGCCAC GCTCAGCCGC TTAGCCCACG AGCATTCCAC CGCGCCCGAA GTTGGTCGCT TGCTTGAACA GCTCGTGCCG TGGGCCGAAC AACAAGATCC TGAATCGGAC GACGCAGCCT TGGTCTTGGT GACCAAGCGC GATTATGACC AAGCGGTGCG CGTGCCCAGC GAGTTTATTG CTGAATTGTA TTCGCACGTC GCCAAAACCT ATACCGCCTG GACCCAAGCT CGCCCCAACA ACGACTTCGC TTCAGTCGCC CCATTACTCG AAAAAACCCT TGACCTCAGC CGCCGCTATG CTGAATTTTT CGGCCCCAGC GAACATATTG CCGATCCACT GATCGATATG GCTGATCAAG GCATGACGGT GGCAAAAATT CGTGAGATTT TTGGACCCTT GCGCGAACAA CTCGTGCCAT TGGTCAAAAC CATCACCGAG CAACCTGCCG CTGACGATAG CTGTTTGTTG CAACACTACC CCGAAGCTGA GCAATTGGCC TTTGGTGAAA GCCTCATTCG CGAAATTGGC TACGACTTCG AGCGCGGTCG CCAAGATAAA ACCCACCATC CCTTCATGAC CAAGTTCTCG ATTGGTGATG TGCGGATCAC CACCCGCTTC CGTGAAAACG ATTTGAGCGA TGGCCTGTTT AGCACAATTC ACGAAACTGG CCATGCCGTC TATGAGTTGG GCGTAAATCC AGCCTACGAA AATACCCCAT TGGCAAGCGG AGCTTCGGCA GGAACCCACG AATCACAATC ACGCTTGTGG GAAAATGTGG TTGGTCGCAG CCGCGCCTTC TGGCAATATG CCTATCCCAA AGCTCAGGCC GCTTTCCCCA ATCAACTGGG CAAGGTCGAT TTAGATACCT TCTATCGGGC GATTAACAAA GTGCAGCGCT CGTTGATTCG CACCGATTCA GACGAAGTGA CCTACAACTT GCACGTTATG ATTCGCTTCG ATTTGGAATT GGCCTTGCTC GAAGGTAAAT TGGCAATTCG CGATTTGCCT GAAGCTTGGC ACGAACGCTA TCGCAGCGAT TTGGGCATTA CCGCACCCGA TAATCGCGAT GGTGTGCTGC AAGATGTCCA CTGGTATGGT GGGATTATCG GTGGCTCGTT CCAAGGCTAC ACGTTGGGCA ATATCCTCAG CGCTCAAGTT TTTGATGCTG CTGTGCGAGC CAATCCCAAT ATACCAACTG AAATTAGCCA AGGCCAATTT GCCAACCTGC ACAATTGGCT GAAATCGAAT ATGTACGTTC ATGGCCGCAA ATATAGCGTA CCAACCTTGA TCCGCAAGGT TACAGGCCAA GACCTCAGCA TTGAGCCGTA TATTCGCTAT CTCCGCACCA AATACGGCGA ACTCTATTCG TTGTAA
|
Protein sequence | MQEQLADLKE KIGVAQDLSY ASAVLNWDQS TYMPAGGAEA RGRQMATLSR LAHEHSTAPE VGRLLEQLVP WAEQQDPESD DAALVLVTKR DYDQAVRVPS EFIAELYSHV AKTYTAWTQA RPNNDFASVA PLLEKTLDLS RRYAEFFGPS EHIADPLIDM ADQGMTVAKI REIFGPLREQ LVPLVKTITE QPAADDSCLL QHYPEAEQLA FGESLIREIG YDFERGRQDK THHPFMTKFS IGDVRITTRF RENDLSDGLF STIHETGHAV YELGVNPAYE NTPLASGASA GTHESQSRLW ENVVGRSRAF WQYAYPKAQA AFPNQLGKVD LDTFYRAINK VQRSLIRTDS DEVTYNLHVM IRFDLELALL EGKLAIRDLP EAWHERYRSD LGITAPDNRD GVLQDVHWYG GIIGGSFQGY TLGNILSAQV FDAAVRANPN IPTEISQGQF ANLHNWLKSN MYVHGRKYSV PTLIRKVTGQ DLSIEPYIRY LRTKYGELYS L
|
| |