Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0590 |
Symbol | |
ID | 5732488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 679819 |
End bp | 681375 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277717 |
Product | peptidase M1 membrane alanine aminopeptidase |
Protein accession | YP_001543366 |
Protein GI | 159897119 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0535796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGTT GGCTTTATAT TGGTTTGGCT GTGCTTTTGA TTGGCTGTGG CCAAGCCTTG AAAGCGACCA GCAGCGAGGT TGCTCCAAGC ACTAAGCCTG CCGCCGCGCC AACGATTGAT TTAACTGCCC TGATCACATG GCAGCCAGTG CCAGCCTATA CCCCACAAAT TCCCACGCCA ACTAATATTC CAATTCCCAC GCCAATTCCC GATGCTTGGC CAAATGAGCA GGCTTTGGCG CTGCGCCCCG AATTTGCGGG CGATGCTACT CATTCAACCA TGCCACGATA TGCAATGCAG ATTACGCTTG AGCCAAATGT AGGTCGTTAT AGCGGCAGCC AAATTATTAC CTACACCAAT ACGACTGGCG CTAGTATCAA TAGTTTGGTG CTGCGTTCGT ATAATAATTT TCCGCCCGAT GCCCAAGGCG ATGGCGGCGA TACCACGCTT GAAGTTACTA AGGCTTGGAG CAATGGCAAC GAATTGAATC TTGCTCGTGA AGCCGAAAAT ACCGCTTTTC GGCTTAATCT CGCCACAAGC CTAGCACCCA ATCAGCAGGT TGTAATCAGC ACCCAATTTG CTGGCACAAT CAAGGCTTGG CCCGATGGCT CGTATCCCTT GCTTTCGGCC TATCCGATGC TAGCGGTTTG GGATGCAGCA GCCAACGATT GGCGCATGGA TGTAACGCGC TTTCCTGATC GGGTGTTTGC TGAAACTGCA CTCTATCGGG TGCAACTCAA GCTACCTGCC GAATATCAAG TCATTGGAGC GGGAACGCTG CTGGAGCAAA ATGCTGATCG CACCAGCCGC ACCTATGTTA GTGGGCCAGT GCGGGAGTGG GCCGCCGCGC TTGGTCAATT TTCGGTCAGT ACAAGCAGCA TTGATGGAAT TGACGTTAAT GCCTATGGCC CCGATTATCT TGATTTGGCG CGGGTGCGCG AGATGGCAAT TGGCGCTTTG ACTAGTTATC AAGCCAAAAT CGGCGCGTAT CCCTACCGCA AACTCGATCT GCATGTGATG CCATGGGATA GCGGCGGCGG CATCGAGTAT CCAGCTTACA CGATTATTTT GGTGAACGAT GGGATCAATC GTGATGCTGA TTATGTGGTG TTCCATGAAG TAGCCCATCA GTGGTGGTAT GGGTTGTTGG GCAACGATGT CTATCGTGAG GCTTGGCTGG ATGAGGCCAT GGCTAGCTAC TTGACCTATG TGGCAACTCA GGACGTGCTT GGTCAAGCTG CCGCCGATAG CTATTACAGC GGCGAAATCG AGCGTTTGGC GCTCAGTAAT CAAGCCAATG GCAATTGGCC AGCAGGCTTG GCGATCAATC AATATCCATC ATTTAATAGC TACTATCGCG CAGTCTATGG CAAAGGCGCG GCGATGTTGC ATCAGTTACG CATCAAGCTT GGCGATCAAA GCTTCTTCCA AGGTTTAGCC CAACTCAACG ACCAAAAGCG CTATCAGATT ATCACCCGTA GCGATTTCCA AAGCGTTATG GAGCAAAGCA GCGGCCAAGC CCTAGGCGAG TGGCTGGATG GCTGGCTCAA CTGGTGA
|
Protein sequence | MRRWLYIGLA VLLIGCGQAL KATSSEVAPS TKPAAAPTID LTALITWQPV PAYTPQIPTP TNIPIPTPIP DAWPNEQALA LRPEFAGDAT HSTMPRYAMQ ITLEPNVGRY SGSQIITYTN TTGASINSLV LRSYNNFPPD AQGDGGDTTL EVTKAWSNGN ELNLAREAEN TAFRLNLATS LAPNQQVVIS TQFAGTIKAW PDGSYPLLSA YPMLAVWDAA ANDWRMDVTR FPDRVFAETA LYRVQLKLPA EYQVIGAGTL LEQNADRTSR TYVSGPVREW AAALGQFSVS TSSIDGIDVN AYGPDYLDLA RVREMAIGAL TSYQAKIGAY PYRKLDLHVM PWDSGGGIEY PAYTIILVND GINRDADYVV FHEVAHQWWY GLLGNDVYRE AWLDEAMASY LTYVATQDVL GQAAADSYYS GEIERLALSN QANGNWPAGL AINQYPSFNS YYRAVYGKGA AMLHQLRIKL GDQSFFQGLA QLNDQKRYQI ITRSDFQSVM EQSSGQALGE WLDGWLNW
|
| |