Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2751 |
Symbol | |
ID | 5734632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3508481 |
End bp | 3509581 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279894 |
Product | peptidase M24 |
Protein accession | YP_001545517 |
Protein GI | 159899270 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGCG TGCGAATTCG ACGTTTGGCT AGCGCTGCCC GCCCGCAAGG GATGGATTAT GTGGTGTTGA TGCCTGGGGC TAACTTACAA TATTTTACGG GCTTGACCTT GCATTTAAGT GAGCGTTTGG CCTTGGCGTT GATCGCTGCT GATGGTCAGA GCATCAATAT TGTGCTGCCA GCCTTGGAGC AACCGCGTGC TTTAGCCGAA TATAGCGGCG AAGTGGCGGT ACGTTGGTTT CCATGGAGCG ATGATGAAGG CCCAATGAAT GCTTTGCGCA ATGCAGCGGC AGGCCTGATT GGTCGCACAG TTGGCGTGGA ATATACGACG ATGCGGGTGC TAGAATTACG CGCTTTAGAA GAAGTTGCGG GCGTACATAG CATCGATGCC AGCGCCGCGA TCGCCAGTTT GCGCATGCAA AAGGGCGCTG ATGAAATTGC CCTGATGCGC GAAGCTGTGC GCATTGTTGA GGCTGGGCTT AAAACCGCAA TTGAGGCGCT TCATCCAGGC CGAACCGAGC GCGAAATTGC CCGCATTTGG GAAGAAGCGA TGCAACTTGA GGGTGGCGAA GGCCCATCAT TTGCGACGAT TGTGGCGAGT GGCCCAAATA GTGCTAATCC ACACCATACG ACGGGCGAGC GCCAAATCCA AACTGGCGAT TTGGTAATTT TGGATGGTGG GGCGTTGTAT CGCGGCTATT GCTCGGATAT TACCCGCACT GTTTGCGTTG GCGAGCCAAA CGAGCAACAA CGGATGCTCT ATGAAACCGT TTTGGCGGCC AATCGCGCTG CCTGTGCCGG AGCCAAACCA GGCATGAGCG GCGCACAGGT TGATCGGCTC GCACGGCAAG TGGTTGAGGA TGCCGAATTA GGCCGTTACT TCATCCATCG CACAGGCCAT GGCTTGGGTA TGGAAATTCA CGAGCCGCCC TATATCGCTA GCACCAACAC CGTTGCCCTG CCAATTGGCA CGGTTTTTAC GGTTGAGCCA GGCACCTATG TTGCTGGAAT TGGTGGCGTG CGGATTGAAG ATGATGTGCT GTTGACCCCC ACTGGCGCTG AATGTTTGAC CAACTTTCCA CGGGAGTTGA TTGTCAAATG A
|
Protein sequence | MSGVRIRRLA SAARPQGMDY VVLMPGANLQ YFTGLTLHLS ERLALALIAA DGQSINIVLP ALEQPRALAE YSGEVAVRWF PWSDDEGPMN ALRNAAAGLI GRTVGVEYTT MRVLELRALE EVAGVHSIDA SAAIASLRMQ KGADEIALMR EAVRIVEAGL KTAIEALHPG RTEREIARIW EEAMQLEGGE GPSFATIVAS GPNSANPHHT TGERQIQTGD LVILDGGALY RGYCSDITRT VCVGEPNEQQ RMLYETVLAA NRAACAGAKP GMSGAQVDRL ARQVVEDAEL GRYFIHRTGH GLGMEIHEPP YIASTNTVAL PIGTVFTVEP GTYVAGIGGV RIEDDVLLTP TGAECLTNFP RELIVK
|
| |