Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2521 |
Symbol | pepN |
ID | 4026927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2825444 |
End bp | 2828083 |
Gene Length | 2640 bp |
Protein Length | 879 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967728 |
Product | aminopeptidase N |
Protein accession | YP_574567 |
Protein GI | 92114639 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAAC CGCAAGCGAC TCATCTCAAG GACTACCTGC CGCCCGCCTA CCGGGTGACC CATACCGAAC TGACTTTCGA CCTGGCCCCC TCGGCGACGA CCGTGACGGC GCGGCTGCAT GTCGAGCGTC ACCCCGAGCG CGAGGCGGGA TTGCCGCTGC GGTTTGCCGG CGACAAGCTG TCGCTGGAAC GTATCGCCGT GGACGGTCAG ACATTGCAGG CGGATGAGTA CCAGGTCGAC GACGAGGGCC TGACCATCCC CACCGTGCCG GAGCGCTTCC TGCTGGACAC TCAGGTGTCG ATCGACCCCG CGGCCAATAC CGCGCTCGAG GGGCTGTATC TCTCCAACGG CATGTTCTGC ACCCAGTGCG AGCCGGAAGG CTTTCGGCGG ATCACCTTCT ATCCCGACCG CCCGGACGTC ATGGCGACCT TCTCCACCAC GGTGGTGGGC GACAAGGAGA CGCTGCCGGT GTTGCTCTCC AACGGCAATC CGGTCGAGCG CGGCGAATTG CCGGGCGAGC GTCACTTCGT CACCTGGGAG GATCCGCACC CCAAGCCCAG TTATCTGTTT GCGCTGGTGG CGGGCAAGCT CGAGAAGGTC GAGGACCACT TCACGACCAT GAGTGGCCGC GACGTCACGT TGCAGATCTG GGTCGAGCCG GAGAACCTGG ACAAGACCGA CCATGCCATG GCCTCGCTCA AGCGCGCCAT GCAGTGGGAT GAAGAGACCT ACGGGCGTGA ATACGACCTC GATCTGTTCA TGATCGTGGC CGTCAACGAC TTCAACATGG GCGCCATGGA GAACAAGGGG CTCAACATCT TCAACTCGGC GGCGGTCCTC ACGCATCCGC AGACCGCCAC CGATGCGGCC TTCCAGCGGG TGGAGGGCAT CGTCGCGCAC GAGTACTTCC ATAACTGGTC GGGCAATCGC GTGACCTGCC GTGACTGGTT CCAGCTCTCC CTGAAGGAAG GCTTCACGGT CTTCCGCGAC CAGACCTTCA GCGCCGACAC CAACTCCGCG CCGGTCAAGC GCATCGAGGA CGTGTCCTTC TTCCGCACGG CCCAGTTCGC CGAGGACGCG GGGCCGACCG CCCACCCCGT GCGTCCGGAC CATTACATCG AGATCTCCAA CTTCTATACC CTGACGATCT ACGAGAAGGG CGCCGAGATC GTGCGCATGC TGCGCAACCT TCTCGGCTGG GAGACGTTCC GTCAGGGGTC GGATCTCTAT TTCGCGCGTT TCGACGGCCA GGCCGTGACC ATCGAGGACT TCGTCGATTG CATGGCGGAG GTTTCGGGGC TCGATCTCGA TCAGTTCATG CGCTGGTACT CCCAGGCGGG CACGCCGGAG ATCGACGCCC ATGGCGAGTA CGACTACGCC AAGTGCGAGT ACCACTTGCG GTTGTCGCAA CGCACGCCGC CCACGCCCGG CCAGGCGGAG AAGGCGCCTT TGCATATCCC GGTGCGCATG GGACTGGTGG GTACCAAGTC CGGACGTGAC CTGAGCCTGA CGCTGGATGG CGAGGCGCTG GGCACCGAAA CGGTGCTGCA CCTGCGTGAC AGCGAGCAGA CCTATGTCTT CACCGGCATC GACGAAGCCC CGACGCCGTC GCTGCTGCGC GGATTCTCGG CCCCGGTCTA CCTGCGCTAC CCGTATTCGC GCGAGGATCT GTCCTTCCTG CTGACCCACG ACGCGGACGA CTTCAACCGC TGGGACGCGG GACAACGGCT CACCATGCTG GCGCTCGACG ACCTCATCGC CGCGCACCGC AACGGCGTGG AGAAAGTCAT GGATGGTCGC GTCATCGACG CGTACCGTCG CCTGTTGACC ACCGAGACCG ATGACAAGGC GGTGCTGGCC GAAATGCTGA CGTTGCCCTC GGAGGCCTAT ATCGCCGAGC AGCAGCCGAT GGTCGACGTG GACGCCATCC ACGCCGCGCG GGACTTCGTG AAGCAGTCAT TGGCCACGGC ATTGCGCGAT GAGTTCCTGG CGCTCTACGA GGCGCATCGT AGCGACGCGC CCTACGCCCC CGAGCCCGAA CAGATCGCGC AGCGTCGCGT GAAGAACGTG GCGCTCGAAT ACTTGATGAG CATCGAGGAC GAGCAGGGCA TTGCGCTGGC CAACGCGCAG GTCGAAGCCG AGGACAACAT GACCGATGTC CGGGCCGCGT TGACGATGCT GGTGCACAGC TCGCGGACCG ATCTCGCCGA ACCGGCGCTC AAGGCCTTCG GCGAGAAATG GGCACACGAT CCGCTGGTCA TGAACCAATG GTTCACCATC CAGGTCACGC GGCCCCAGGC GGACGTTCTC GAGCGGGTCA AGTTCCTCAT GGCGCATCCG TTGTTCTCGC TCAAGAACCC CAACAAGGTG CGGGCGTTGA TCGGGGCGTT CGCGGCGCAG AACCGCGTCA ATTTCCATCG TCTCGACGGC GAGGGCTATC GCCTGCTGGC CGATGTGGTG ATCGAGCTCA ACCGGCTCAA TCCCGAGATC GCGGCGCGGA TCATCACGCC GTTGACGCGC TGGCAGCGTT TCGACGAGCA GCGTCAGGCC CTGATGAAGG CCGAGCTCGA GCGCATTCGC GCCGAGGAGC TCTCGCCCAA CGTCTTCGAA ATGGTCGAGC GCGCGCTGGC CGACGCCTGA
|
Protein sequence | MSEPQATHLK DYLPPAYRVT HTELTFDLAP SATTVTARLH VERHPEREAG LPLRFAGDKL SLERIAVDGQ TLQADEYQVD DEGLTIPTVP ERFLLDTQVS IDPAANTALE GLYLSNGMFC TQCEPEGFRR ITFYPDRPDV MATFSTTVVG DKETLPVLLS NGNPVERGEL PGERHFVTWE DPHPKPSYLF ALVAGKLEKV EDHFTTMSGR DVTLQIWVEP ENLDKTDHAM ASLKRAMQWD EETYGREYDL DLFMIVAVND FNMGAMENKG LNIFNSAAVL THPQTATDAA FQRVEGIVAH EYFHNWSGNR VTCRDWFQLS LKEGFTVFRD QTFSADTNSA PVKRIEDVSF FRTAQFAEDA GPTAHPVRPD HYIEISNFYT LTIYEKGAEI VRMLRNLLGW ETFRQGSDLY FARFDGQAVT IEDFVDCMAE VSGLDLDQFM RWYSQAGTPE IDAHGEYDYA KCEYHLRLSQ RTPPTPGQAE KAPLHIPVRM GLVGTKSGRD LSLTLDGEAL GTETVLHLRD SEQTYVFTGI DEAPTPSLLR GFSAPVYLRY PYSREDLSFL LTHDADDFNR WDAGQRLTML ALDDLIAAHR NGVEKVMDGR VIDAYRRLLT TETDDKAVLA EMLTLPSEAY IAEQQPMVDV DAIHAARDFV KQSLATALRD EFLALYEAHR SDAPYAPEPE QIAQRRVKNV ALEYLMSIED EQGIALANAQ VEAEDNMTDV RAALTMLVHS SRTDLAEPAL KAFGEKWAHD PLVMNQWFTI QVTRPQADVL ERVKFLMAHP LFSLKNPNKV RALIGAFAAQ NRVNFHRLDG EGYRLLADVV IELNRLNPEI AARIITPLTR WQRFDEQRQA LMKAELERIR AEELSPNVFE MVERALADA
|
| |