Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_0899 |
Symbol | pepN |
ID | 3672690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | - |
Start bp | 948937 |
End bp | 951714 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637709578 |
Product | aminopeptidase N |
Protein accession | YP_314657 |
Protein GI | 74316917 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.570947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.595006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACCG AAACCCCCGT CACGCTTTAC CTGAAGGATT ACGCCCCGCC CGCCTGGCGG ATCGAGTCGG CCGACCTGCA CGTCGCCATC CACGACGACC ATGCCGAGGT GCGTGCGCGC CTCGACTGCG TGCGCAACAC CGCGGGCGGC GACGACGCGC TGGTTCTGAA CGGCGAAGCG CTGGAACTCG TCGGCCTCAG CCTCGACGGC GCGCCGCTCG ACCCCGCGCG CTACGTGTAC GGCGACGATC TGCTGCGCAT CGCGGGGCCC CTGCCCGATC GCTGCGTGCT CGAAAGCGTC GTGCGCATCC GCCCCGATCT GAACACCCAG CTCTCGGGCC TCTACCGCTC GCAGGACGGC TATTTCACGC AATGCGAGCC CGAAGGCTTC CGCCGCATCA CCTTCTTCCC GGACCGGCCC GACGTGATGA CGAAATTCAG CTGCACGGTC GAAGCCGACC GCGCGCGGTT TCCGCATCTG CTGTCGAACG GCAATCCCGT GGCGGCGGGG GTCTGCGACG ACGACGCAGC GCGGCACTGG GCGCGCTGGG ACGACCCCTA CGCCAAGCCG TGCTACCTGT TCGCGCTCGT CGCCGCCAGG CTCGACGTGC TGGAGGACGA ATACGTCACC GCCTCGGGAC GCAAGGTCAG GCTCGCGGTC TACGTCGAGC CGGGCAAGCT CGACCAGTGC GGCCACGCGA TGGCCGCGCT GAAAAAAGCC ATGCGCTGGG ACGAAGAGCG CTTCGGCCTC GAATACGACC TCGACCAGTA CATGATCGTC GCCGTCGGCG ACTTCAACAT GGGGGCGATG GAGAACAAGG GCCTCAACAT CTTCAATACC AAATACGTGC TCGCCCGCCC CGACACCGCG ACCGACGCCG ACTACCAGGG CATCGACCGC GTGGTGGCGC ACGAGTATTT CCACAACTGG ACCGGCAACC GCGTCACCTG CCGCGACTGG TTCCAGCTGT CGTTGAAGGA AGGCCTCACG GTGTTCCGCG ACCAGGAATT CGGCGCCGAC ACGCACTCGC GCGCGGTCAC CCGCATCCAG GAAGTGCGCG CACTGCGCGT CGGCCAGTTC CCCGAAGACG TCGGGCCGAT GGCGCACCCG ATCCGGCCGG CCTCCTACGC CGAGATCAAC AACTTCTACA CCGCCACCGT CTACAACAAG GGCGCGGAAG TCATCCGCAT GATGCACACC CTGCTCGGGC GCGAGGCGTT CCGCCGCGGC ACGGATTTGT ACTTCGCGCG CCACGACGGC CAGGCCGTCA CCTGCGAGGA TTTCGTCGCA GCGATGCAGG ATGCTTCCGG GATCGACCTC GCGCAGTTCC GCCGCTGGTA CGCGCGCGCC GGCACGCCGC GCCTGAACGC ATCAAGCAGC TACGACGCCA CGACCCGACG CTACACGCTG ACGCTCACGC AAACCCTCGC GCCGACCGCC TACGAAAAGC GGCTGACCGA ATCGGGACAG GCCATCGTCG ACGGCACGCT GCACATCCCC GTCGCGCTCG GGCTGGTATT GCCAAATGGT AATGACGCGC CGCTGAAACT CGCCGGTGAG GCCGAGGCGA GCGGCACCAC GCGCGTCCTC TCGCTGACCG AGCCCACGCA GACCTTCGTC TTCGAGGACA TTCCCGCCGC CCCCGTCGCC TCGCTGCTGC GCGATTTTTC GGCGCCGGTG CAGTTGGAGT TCGAACAGAC CGACGCCGAG CTCGCGCACC TGATGGCACA CGATGACGAC GCCTTCAACC GCTGGGAAGC CGGGCAGCGG CTGGCGACGC GGGTGCTGCT CGCCGGTATT GCCGCGCAGC AACGCGCAGC GTTGATGCGC CATACTCTCC CGCATGCGGG AGAGGACGAC TCGTGGCGTG GCGCCGGCAG CGACTGGATT CCCGACGCCT TCGTCGCCGC CTGCGGCCGC GTGCTCGACG CCGGCCTCGC CGGCGACCCG GCGCTGGCTG CCGAAGCGCT CAACCTGCCC GCCGAAGCCG TGCTCGCCGA AGCGGTGGTA TCGCTTGGCC AGCCCATCGA CCCCGAAGCC ATCCACGCCG CGCGCGTGGC CCTGCGCCGC CACCTCGCCG CCCGTCTGCG CGACGCCTTC GAAGCCGCTT GGGCCGCCCT CACCCCAACT GCGGCCTATG CGCCCGACGG CGCCCAGGTC GGCCAGCGCG CCCTGCGCAA CGCCTGCCTC GGCTACCTCG CCGAAAGCGA CGTCGAGTAT TTGCAGAGCG CGGTCGTCCC GCGCCTCACC GCCCAGCTCG CGGCGGGCGG CAACATGACC GACGGCAACA TGACCAGTCA AATGGCCGCG CTGGCCACGC TCGCCAACCT CGACCTGCCG GAAAGGGAAG CGACGCTCGC CGATTTCTAC ACGCGCTGGC AAAACGAAGC CCTGGTCGTC GACAAGTGGT TTGCCGTGCA AGCCACCTCG CGCCTGCCAG GCACCGCCGC GCGCGTGCGC GCGCTGATGC AGCATCCGGC GTTTGACCTC AAGAACCCCA ACCGCGTCTA TGCCCTGATC CGCGGCTTCT GCGGCGCCAA CCCGCGCCAC TTCCACGCCT TTGATGGCAG CGGCTACGCG CTGGCGGCCG ACGTCATCAG CGAATTGCAG GCCATCAACC CCCAGGTCGC GTCACGCATC GCGCGCAGCT TCGATCGCTG GCGGCAGTTC GACGCCGGTC GGCAGGCACA CGCGCGCGTG GCGCTGGAAC GTATCGCGGA GATCGAGGAT CTAGCGAAGG ACGTGGCGGA AGTTGTGGGG AATGCGCTAA AGGGTTGA
|
Protein sequence | MRTETPVTLY LKDYAPPAWR IESADLHVAI HDDHAEVRAR LDCVRNTAGG DDALVLNGEA LELVGLSLDG APLDPARYVY GDDLLRIAGP LPDRCVLESV VRIRPDLNTQ LSGLYRSQDG YFTQCEPEGF RRITFFPDRP DVMTKFSCTV EADRARFPHL LSNGNPVAAG VCDDDAARHW ARWDDPYAKP CYLFALVAAR LDVLEDEYVT ASGRKVRLAV YVEPGKLDQC GHAMAALKKA MRWDEERFGL EYDLDQYMIV AVGDFNMGAM ENKGLNIFNT KYVLARPDTA TDADYQGIDR VVAHEYFHNW TGNRVTCRDW FQLSLKEGLT VFRDQEFGAD THSRAVTRIQ EVRALRVGQF PEDVGPMAHP IRPASYAEIN NFYTATVYNK GAEVIRMMHT LLGREAFRRG TDLYFARHDG QAVTCEDFVA AMQDASGIDL AQFRRWYARA GTPRLNASSS YDATTRRYTL TLTQTLAPTA YEKRLTESGQ AIVDGTLHIP VALGLVLPNG NDAPLKLAGE AEASGTTRVL SLTEPTQTFV FEDIPAAPVA SLLRDFSAPV QLEFEQTDAE LAHLMAHDDD AFNRWEAGQR LATRVLLAGI AAQQRAALMR HTLPHAGEDD SWRGAGSDWI PDAFVAACGR VLDAGLAGDP ALAAEALNLP AEAVLAEAVV SLGQPIDPEA IHAARVALRR HLAARLRDAF EAAWAALTPT AAYAPDGAQV GQRALRNACL GYLAESDVEY LQSAVVPRLT AQLAAGGNMT DGNMTSQMAA LATLANLDLP EREATLADFY TRWQNEALVV DKWFAVQATS RLPGTAARVR ALMQHPAFDL KNPNRVYALI RGFCGANPRH FHAFDGSGYA LAADVISELQ AINPQVASRI ARSFDRWRQF DAGRQAHARV ALERIAEIED LAKDVAEVVG NALKG
|
| |