Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2642 |
Symbol | |
ID | 5734522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3389943 |
End bp | 3391043 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641279784 |
Product | peptidase M29 aminopeptidase II |
Protein accession | YP_001545408 |
Protein GI | 159899161 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2309] Leucyl aminopeptidase (aminopeptidase T) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTGATC AACGCTGGCA ACAGTTGGCC AAAATCATTG TCCATCACTC ATTAGAGCTA CAACCAAACG ATTTGCTACG AATTCAAGCT GAAGCTATTG CGAAGCCATT GCTCTATGCG CTCTACCGCG AGGCACTCCA TGCTGGAGCC TTAGTTATTC CCAAAATTGT CGATCCGGTA TTTGAAGAAA TTATGCTCAA AGAGGGCACT CCTGAGCAGC TGCAATTTGT GCCAAGCACC TTAGTTCACG AAATTGAAAC CATGACTACT TGGTGTGATA TTTATAGCGA AATAAATACG AAACATTTCA ACCAAGCCGA TCAACAACGC CAACTATTGC GCCGAAAAGC ATTCGGCCCA GTCCAAGTAT TATTCGATAG CAGAGCAGCT CAAAATCAAT TGCGCTGGTG CGATGTGCTT TATCCAACCG AGGCTTTTGC TCAAGACGCA GGTATGTCGC TGTGGGATTT TGAAGACTTG GTAGTAAAAT CCTATCTGCT TGATCATCCA GATCCAGTTA CAGCGTGGCA GACCATCCAT CAACAACAGC AAAAAGTTAC CCACTTCCTC AATAGTTGTC GCTCAATTCG GATTGAAGGG CCAGACGTTG ATTTGAGTTA TCGCTGTGAA GATCGCATTT GGATTAATTG TGCTGGTAAA CGCAATCTGC CCGATGGCGA AGTCTTTACC GCGCCAATCG AAGATTCAGT CAATGGCCGA TTGAAGATTA GCTATCCAAG CATTTATCAG GGGAATTTGG TTAGCGGAAT TCAGCTCGTG ATTGAAGATG GCAAAGTAAC CCAAGCAACT GCTGAGCAAG GCCAAGATTT TCTGCATACC ATGCTTGATC TTGATGCTGG TGCTCGGTAT ATTGGCGAGG TTGCCTTCGG CCTGAACCCA GGCATTACAA AACCAACTGG TCATACTATT TTCGATGAAA AGATGGCTGG AACGATGCAC TTAGCACTTG GTCGAGCCTA TCCTGAGTGT GGCGGCAAAA ATGAATCAAC CCTGCACTGG GATTTAGTCT GCGATTTACA TCAAGCTGAA GTGTATGCCA ATAATGCGCT GTGCTACAAA AATGGTGAGT TTATTATTTA A
|
Protein sequence | MFDQRWQQLA KIIVHHSLEL QPNDLLRIQA EAIAKPLLYA LYREALHAGA LVIPKIVDPV FEEIMLKEGT PEQLQFVPST LVHEIETMTT WCDIYSEINT KHFNQADQQR QLLRRKAFGP VQVLFDSRAA QNQLRWCDVL YPTEAFAQDA GMSLWDFEDL VVKSYLLDHP DPVTAWQTIH QQQQKVTHFL NSCRSIRIEG PDVDLSYRCE DRIWINCAGK RNLPDGEVFT APIEDSVNGR LKISYPSIYQ GNLVSGIQLV IEDGKVTQAT AEQGQDFLHT MLDLDAGARY IGEVAFGLNP GITKPTGHTI FDEKMAGTMH LALGRAYPEC GGKNESTLHW DLVCDLHQAE VYANNALCYK NGEFII
|
| |