Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0120 |
Symbol | pepN |
ID | 4710619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 134820 |
End bp | 137531 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854578 |
Product | aminopeptidase N |
Protein accession | YP_001001716 |
Protein GI | 121996929 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.269734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCTCAG GGTGCCCGGA TCAGGTAAGC TGCTTATTCC ACACCGATCA CCACACTATA TTCAACTCGA GCGCCCCCAT GAGTGCATCC CGACCGCAGA CCATCCGCCT GCGCGACTAC CAGCCGCCAG CCTTCCTCGT CGAACGCATC CACCTCCGCG TCGACCTCAG CGGCGGCACC GCCCAGGTGG ATGCCACTCT GGATCTGCAC CGCAACCCGG CTGCCGACAA CGACGCCCCC CTGCGCCTGG ACGCCGAATT CCTGGATCTG GAGTCGCTGA CCCTGGACGG CGCCGAGCTG GAGCCGAGCA CGCTGCGCGA CGAACAGGGC ACGATCGTCC TCCATGATGT CCCGCAGCGC TGTCGGGTGG AGTCGGTCAG CCGCTTCGAC CCGGCAGCCA ACACGGCCCT CTCCGGTCTG TATCGCTCCG GCGGTATGTT CTGCACCCAG TGCGAAGCGG AAGGGTTCCG TCGCATCACG CCCTACCCCG ACCGCCCGGA CGTGCTGGCC CCGTTCACCA CCACGGTGGT CGCCGACCGC GCCACCTGTC CGGTACTGCT ATCCAATGGG GATTGCATCG ACCGGGGTGC GCTCGACGCG GAGCGCCACT ACGCCGTCTG GCACGACCCC TTCCCCAAGC CGTCGTACTT GTTCGCCCTG GTCGCCGGCG ACCTGGCCTG TCAGGAGGCC ACCTTCGTGA CCGCCTCGGG ACGCGAGGTC GCCCTGCACT TCTATGTCGA GCCCGAAAAC GCCGGACGCA CCGAGCACGC CTTGGCTGCC CTGCAGCGGG CCATGCGCTG GGACGAGACC CACTACGGCC TGGAGTACGA CCTGGACACC TACATGGTCG TCGCCGTCGG CGACTTCAAC ATGGGCGCCA TGGAAAACAA GGGGCTGAAT GTCTTCAACA CGCAGTTCGT CCTGGCCAGC CCCGACACCG CCACCGACGC CGATTACGAG AACATCGAGG CGGTCATCGG CCACGAGTAC TTCCACAATT GGACCGGCAA CCGGGTCACT TGCCGCGACT GGTTCCAGCT CTCCCTCAAG GAGGGTCTGA CCGTCTTCCG CGAGCACCAG TTCGCCGAGG CCATGGGCTC GGAGGCCGTC CAACGCATCG GCCAGGTCCG TCTGCTGCGC ACGGCCCAGT TCCCCGAGGA CGCTGGCCCC ATGAGCCATC CGGTCCGCCC CGACTCCTAC GTGGAGATCA ACAACTTCTA CACCGCAACC GTCTACGCCA AAGGCGCTGA AGTCATCCGC ATGTACCACA CGTTGCTCGG CGACGATGCC TTTCGACGCG GGGTCCAGCG CTACCTGCAA CGCCACGACG GAGAGGCCGC CACCATCGAG GATTTCCTCG CCGCCATGGA GGAGGCCGGC GGGCTGGATC TGCAGCAATT CGCACTGTGG TACACCCAGG CGGGCACTCC GCGGATCGAG GTCGAAGACC ACTACGACGC GGAGCACAAC ACCTACACCC TGATCTGCCG GCAGTCCCTG CCGGCCAGCC CCGGCCAGCC TCACAAGGAT CCGATGCACA TCCCGCTGGC CGTCGGGCTG CTCGGCCGAG ACGGCAGACC GCTCGCCGCC CGTCGACCGG ACGAGTCCAC TGCCCACGCC CACACCCGCG TCCTCGAACT ACGCGAGCCC GAGCAGCACT TTGTCTTTGA GCAGTGCCCG GAACGCCCCG TGCCCTCGCT GCTGCGGGGG TTCTCGGCAC CGGTCAAGCT GCACTATCCG TACACCGATG ACGACCTCGG CTTCCTGCTC GCCCACGACT CCGACCCATT CGCACGCTGG GAAGCGGGAC AGCAACTGGC GCTGCGCGTC CTCCTCGATG AGGCGCAGGG CACCCCACGC CCGGACGGCA TTGAACGGCT TGAGGAGGCG TTCCGCGCCT CTCTGGAAGC GCCCGACACC GACCCCGCCC TGGTCGCCGA AGCGCTGACC CTGCCCGGCG AGACCTACCT CGCCGAGCAG ATGGACGTCG TGGACCCCCA GGCGATCCAC GACGCCCGCC AGCGGGTCCG GGCCATTCTG GGCGAGGCAC TCGAGGGCCA CTGGCTGATC CTGCACAGTC AGCACAGCGG CCCGTGGCAC TACGAGCCCG ATGAGGTCGC CCGGCGCCGC CTGCGCAACC TGGCCCTCGG CTACCTGAAC GCCGGCTCGG AGCGCCACCT GAGCCGGGCA CTCGAGCAAC TCGAACGCAG CGACAATCTC ACCGATCGAC TCGCCGCGCT GGCGGTGCTC GCGGACAGTC CTCGGGCCGA GGCCGAGCAA GCGGTAGACG CCTTCTATCA GCGCTGGTCC GATGAACCGC TCGTAGTCGA TAAGTGGTTC CGGACCCAGG CGCTGGCCGA CCGGCCGGAC ACCGTTTACC GGGTCCATCA GCTCACCGAG CACCCGGACT TCACGCTGGA CAACCCAAAC CGGGCCCGCG CCCTACTGGG CGCGTTTGCC CAGGGCAACC CGGCCCACTT CCACAGCCCG GACGGCAGCG GTTATCGCCT CCTCGGCGAG CACGTCCTTC GCCTCGACCC GAACAACCCC CAGCTCGCCG CCCGGCTGCT GGCACCCCTG GCCCAGTGGC GCCGCTACAA CGCGAGCCGT CGCCACGCCA TGCGCGAGCA ACTCGAACGG ATCCTTGAGC GCGATGCGCT GTCGAAGGAT GTGTACGAGG TCGCTTCCAA AAGCCTGGGG GCGATGAGCT AA
|
Protein sequence | MCSGCPDQVS CLFHTDHHTI FNSSAPMSAS RPQTIRLRDY QPPAFLVERI HLRVDLSGGT AQVDATLDLH RNPAADNDAP LRLDAEFLDL ESLTLDGAEL EPSTLRDEQG TIVLHDVPQR CRVESVSRFD PAANTALSGL YRSGGMFCTQ CEAEGFRRIT PYPDRPDVLA PFTTTVVADR ATCPVLLSNG DCIDRGALDA ERHYAVWHDP FPKPSYLFAL VAGDLACQEA TFVTASGREV ALHFYVEPEN AGRTEHALAA LQRAMRWDET HYGLEYDLDT YMVVAVGDFN MGAMENKGLN VFNTQFVLAS PDTATDADYE NIEAVIGHEY FHNWTGNRVT CRDWFQLSLK EGLTVFREHQ FAEAMGSEAV QRIGQVRLLR TAQFPEDAGP MSHPVRPDSY VEINNFYTAT VYAKGAEVIR MYHTLLGDDA FRRGVQRYLQ RHDGEAATIE DFLAAMEEAG GLDLQQFALW YTQAGTPRIE VEDHYDAEHN TYTLICRQSL PASPGQPHKD PMHIPLAVGL LGRDGRPLAA RRPDESTAHA HTRVLELREP EQHFVFEQCP ERPVPSLLRG FSAPVKLHYP YTDDDLGFLL AHDSDPFARW EAGQQLALRV LLDEAQGTPR PDGIERLEEA FRASLEAPDT DPALVAEALT LPGETYLAEQ MDVVDPQAIH DARQRVRAIL GEALEGHWLI LHSQHSGPWH YEPDEVARRR LRNLALGYLN AGSERHLSRA LEQLERSDNL TDRLAALAVL ADSPRAEAEQ AVDAFYQRWS DEPLVVDKWF RTQALADRPD TVYRVHQLTE HPDFTLDNPN RARALLGAFA QGNPAHFHSP DGSGYRLLGE HVLRLDPNNP QLAARLLAPL AQWRRYNASR RHAMREQLER ILERDALSKD VYEVASKSLG AMS
|
| |