Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3114 |
Symbol | |
ID | 5734986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3928331 |
End bp | 3929434 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280258 |
Product | peptidase M29 aminopeptidase II |
Protein accession | YP_001545880 |
Protein GI | 159899633 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2309] Leucyl aminopeptidase (aminopeptidase T) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.185719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATC CACGGGTTCT CAAAATGGCT CATACGTTGG TCAATTATTC GATGAAAATC AAAGAGGGCG AATTGGTGGT TTTGCAGAGT GAGCCTGCTG CTGCGCCCTT AGTTCAAGCA ATGTATCGCG AAATTTTGCT GGCTGGCGGG CATCCAGTTG CCCATACGGT TATGCCTGGG CTTTCGCGAA TTTTGCTCAA TCATGGCAAC GATAAGCAAT TGCAGTGGAT CTCGCCCTAC GATCGTTTGG GGATTGAAAC TGCTGATGTG CGGATTCGGA TCGATGCTCA AAGCAACACC CGCGAACTCT CGCAGGTTGA CCCTGAGCGC CAATCGGTGT TTCAAAAATC ACGCCGCGAA TTGATGGGCA CGTTGATGTC ACGCACCCAT GCTGGCGATT TTCGCTGGTG TGTCACGCTC TTTCCAACCG AGGGCTTGGC TCAAGATGCC AATATGAGCT TGCCTGATTT TGAGGATTTT GTGTATGGCG TGTGCTTTTT GAACGAAGCC GACCCAATCG CCAAATGGCA AGAACTTCAC GATATGCAAG CACATTTGAT CAATTACTTG CAAAATAAGC GCGAAGTGCA TATCCTAGGT GAAGGCACCG ATATTCGGGT TGGGATTGCT GGGCGCAGCT TTGTCAACTG TGCAGGCGAT GCCAACTTCC CTGATGGCGA GTTCTTTACT GGCCCCGAAG AAACTAACGT CAATGGTGTT GTACGCTTTT CGTTCCCAGC GATCTATAAT GGACGCGAAG TCGAAGACGT GCAATTAACC TTCGAAGCTG GTAAAGTTGT CAAGGCAACC GCTAAGAGTG GCCAAGATTT CCTTGAGCAA ATGTTGAATG TTGATGCTGG CGCACGGATT TTGGGCGAAT TTGCCTTTGG CACCAACCCC AACATCAAAA ATTACACCCG TAACATCTTA TTCGATGAGA AAATGGGTGG TACAATCCAT ATGGCAGTCG GCGCTTCCTA CCCTGAAACT GGCGGTTTGA ATCAATCGGC AATCCACTGG GACATGGTTT GCGATTTACG TCAAAACAGC GAAGTGTATG TCGATGGTCA ATTGTTCCAA AAAAATGGCA CATTCGTGGT CTAA
|
Protein sequence | MADPRVLKMA HTLVNYSMKI KEGELVVLQS EPAAAPLVQA MYREILLAGG HPVAHTVMPG LSRILLNHGN DKQLQWISPY DRLGIETADV RIRIDAQSNT RELSQVDPER QSVFQKSRRE LMGTLMSRTH AGDFRWCVTL FPTEGLAQDA NMSLPDFEDF VYGVCFLNEA DPIAKWQELH DMQAHLINYL QNKREVHILG EGTDIRVGIA GRSFVNCAGD ANFPDGEFFT GPEETNVNGV VRFSFPAIYN GREVEDVQLT FEAGKVVKAT AKSGQDFLEQ MLNVDAGARI LGEFAFGTNP NIKNYTRNIL FDEKMGGTIH MAVGASYPET GGLNQSAIHW DMVCDLRQNS EVYVDGQLFQ KNGTFVV
|
| |