Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0681 |
Symbol | |
ID | 4710368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 764071 |
End bp | 765570 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639855144 |
Product | leucyl aminopeptidase |
Protein accession | YP_001002265 |
Protein GI | 121997478 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.516579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCTCA AGACCAAAAG CGGGGATCCG GCGCGGCAGC GCACCGCCTG TGTCGTGGTG GGCGTCTACG AGCGCCGACG GATGAGCGAG GCGGCGCGGG CCGTCGACGC CGCCAGTGAC GGTTACCTGA GCCATCTGCT GCGCCGGGGC GACCTTGAAG GCGAGGCCGG GCAGACCCTG CTACTGCCCG ACTGCCCCGG GGTGCGCACC GATCGCGTAC TGCTCGTCGG CTGCGGGCGC GAGCGCGACT TCAACGAGCG CACCTACCGC AAGGCCGTCA CCGCGGCGGC CCGCGCGCTG GAGCAGGCGG GCACCGGCGA GGCGATCCTG TTCCTGCCCG AGCTACCCGT CCGCGGCCGC GACGTGGCCT GGCGCGTGGC CGCCACCGCC GAGATCCTCG AGACCACGCT CTACCGCTTC GACACCTACA AGAGCGACCC GCGCCCGCCG CGCCGCCCGC TGCGCCAGGC CACCCTGGCC GTCCCGCGGC GTGCCGACCT GCGCCGCGCC CAGCCGGCGC TCACCCTGGG CCAGGCCGCC GGCCGCGGCG CCAACTTCTC CCGCGACCTG GGCAACACCC CGGCCAACAT CTGCACCCCC GGCTACCTGG GGGAACAGGC CGAGGCCCTG GCCCAGCGCT TCGACGGCGT GCGCGCCGAG ATCCTCGGTC CGGCGGAACT CGAAGAGCAG GGCCTGGCGG CCCTGCTGGC CGTGGCCCGC GGCGCCGAGG CGCCGCCCCG GCTGGTGGTG CTGCACTACC GCGGCGCCGA CGACGACCAG GCCCCGGTGG CCCTGGTGGG CAAGGGCATC ACCTTCGACA GCGGCGGCAT CTCCATCAAG CCGTCGGCGA GCATGGACGA GATGAAGTAC GACATGTCCG GCGCCGCCGC GGTCTTCGGC GCCGTCCACG CCGCCGCCGA GGCGCAGCTG CCGCTGAACC TGGTGGCCGT CATCCCGGCC ACCGAGAACA TGCCCGATGG CCGCGCCACC CGCCCCGGGG ACATCATCGA CAGCCTCGAC GGGCAGCGCA TCGAGGTCCT CAACACCGAC GCCGAAGGCC GCCTGGTGCT GGCCGACGGC CTCGCCTACG CCCGCCGCCT GGAGCCGAGC GAGGTGGTCG ACGTAGCCAC CCTGACCGGC GCGGCCATCA TCGGCCTCGG CCACCACCGC CACGCGGTGA TGGGCAACGC CCCGGGGCTG GTGCGCGACC TGCTCCAGGC CGGCGAGCGC GCCGCCGACC GCGGCTGGGA GCTGCCCCTG GACGAAGAGT ACGATGAGCA GCTGCGCTCG CCCTTCGCCG ACGTGGCCAA TATCGGCGGA CAGCCGGCGG GCACCATCAC CGCTGGCTGC TTCCTGCAGC GCTTCGCCCG CGGGCTGCGC TGGGCGCACC TGGACATCGC CGGCACCGCC TGGAAGAGCG GCGAGCACAA GGGCGCCACC GGGCGGCCGG TTCCCCTGCT CACCCACTTC CTCGCCGGCC GCGCCGGCTG GACGCTGTGA
|
Protein sequence | MELKTKSGDP ARQRTACVVV GVYERRRMSE AARAVDAASD GYLSHLLRRG DLEGEAGQTL LLPDCPGVRT DRVLLVGCGR ERDFNERTYR KAVTAAARAL EQAGTGEAIL FLPELPVRGR DVAWRVAATA EILETTLYRF DTYKSDPRPP RRPLRQATLA VPRRADLRRA QPALTLGQAA GRGANFSRDL GNTPANICTP GYLGEQAEAL AQRFDGVRAE ILGPAELEEQ GLAALLAVAR GAEAPPRLVV LHYRGADDDQ APVALVGKGI TFDSGGISIK PSASMDEMKY DMSGAAAVFG AVHAAAEAQL PLNLVAVIPA TENMPDGRAT RPGDIIDSLD GQRIEVLNTD AEGRLVLADG LAYARRLEPS EVVDVATLTG AAIIGLGHHR HAVMGNAPGL VRDLLQAGER AADRGWELPL DEEYDEQLRS PFADVANIGG QPAGTITAGC FLQRFARGLR WAHLDIAGTA WKSGEHKGAT GRPVPLLTHF LAGRAGWTL
|
| |