Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1185 |
Symbol | |
ID | 4709244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1288520 |
End bp | 1289878 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639855658 |
Product | peptidase M24 |
Protein accession | YP_001002762 |
Protein GI | 121997975 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAGA CCATCGATTT CACCCAGCTT GCCTGCGAGC AGCGGGAGAC CCTGGCTCGG CGCATCGGTG AGAGCGCGGT GGTGGTGGTC CCAGCGGCGC GGGAACAGCC CCGCAACCGC GATGTGGACC ACCCCTTCCG CCAGGACAGC GACTTCCGCT ACCTCACCGC CTTCCCCGAA CCCGACGCGG TGGCGGTGCT CGCCCCCGGC CGGCCCGAGG GCGAGTATGT GCTGTTCGTC CGTGAGCGCG ACCCGGAGGC GGAACGGTGG GCCGGGGCGC GCACCGGCCC CGAGGCCGCC TGCCAGGCCT ACGGCGCCGA TCAGGCCTGG CCGCTGGGAG AGCTCGATCA GCGACTGCCC GACCTGCTCG TCGGCCGGGA ACGGATGATC GCGCCGCTGG GCCGCGACGA GCACTGGGAC CGCCAGCTCC TGCAGTGGCT GCAGGCCGGG CGGGCGCGAG CCCGGGGCCA GGCCGTCGCC CCGGACCGCA TCGAGCTGCT CGACCGCAAC ATCCACGAGC AGCGGCTGAT CAAGCGCCCC GCCGAGCTCG AAGCGATGCG CCGGGCAGCC GGCATCTCGG TGGCGGCGCA TCGGCGCGCC ATGCAGGCCG TCCAGTCGGG GATGCCCGAG TACGCGCTGG CTGCCGAGCT GCTCGGCATC TTCCACCGAC ACGGCGGCGA GGCCGCCTAT CCGAGCATCG TCGCTGGGGG CGCCAACGCC TGCGTACTTC ATTACGTCAC CCTGCGCAAC ACACTGCACG AGGGCGACCT GGTCCTCATT GACGCCGGCG CCGAGGTGGA CGGCTACGCC GCCGATATCA CCCGCACGTT CCCGGTCAGC GGGGTCTTCA GCGCCGAGCA GCGAGCCGTC TACGACGTGG TCCTCGAGGC CCAGGAGGCA GCCATCGGGC AAGTGTGCAG CGGCAACGAC TTCGACGCCT TCCACCGCAC CGCCACGCGC ATCCTCACCC AGGGCATGGT GGATCTCGGC TGGCTCCGGG GCGAGGTGGA CGGACTGATC GAGCAGGGCG CCCACCGGCG CTTCTTCCCC CACCGCACCG GTCACTGGCT GGGACTGGAC GTACACGACG TCGGCAGCTA TGCAGTAGAG GGAGCGTGGC GCGTCCTCCA GCCTGGCATG GTGGTGACCG TCGAGCCGGG GCTCTACTGC CCGCCGGGCA GCGAGGAGGT GGATCCACGC TGGCACGGGA TCGGCGTTCG CATCGAGGAC GACGTGGTTG TCGAGCGGGA GACCCCGCGC ATCCTCACCA GCGGGGTGCC GAAGACCCCC GAGGCCATTG AGGATCTGAT GGGCGCCGTG CGCGGCGCAG GCTACGAGGA AAGTGGAGAC TTCGACTGA
|
Protein sequence | MNETIDFTQL ACEQRETLAR RIGESAVVVV PAAREQPRNR DVDHPFRQDS DFRYLTAFPE PDAVAVLAPG RPEGEYVLFV RERDPEAERW AGARTGPEAA CQAYGADQAW PLGELDQRLP DLLVGRERMI APLGRDEHWD RQLLQWLQAG RARARGQAVA PDRIELLDRN IHEQRLIKRP AELEAMRRAA GISVAAHRRA MQAVQSGMPE YALAAELLGI FHRHGGEAAY PSIVAGGANA CVLHYVTLRN TLHEGDLVLI DAGAEVDGYA ADITRTFPVS GVFSAEQRAV YDVVLEAQEA AIGQVCSGND FDAFHRTATR ILTQGMVDLG WLRGEVDGLI EQGAHRRFFP HRTGHWLGLD VHDVGSYAVE GAWRVLQPGM VVTVEPGLYC PPGSEEVDPR WHGIGVRIED DVVVERETPR ILTSGVPKTP EAIEDLMGAV RGAGYEESGD FD
|
| |