Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0706 |
Symbol | pepN |
ID | 4076983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 756633 |
End bp | 759197 |
Gene Length | 2565 bp |
Protein Length | 854 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006003 |
Product | aminopeptidase N |
Protein accession | YP_612701 |
Protein GI | 99080547 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.281728 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGACG CCGCCCCAAG CACGCAGCCA GAGACCTTCT ACCTGAAGGA CTACACCCCG TTTGGCTATG TGGTGGAGCG CGTCGAGCTG GTATTTCGTC TCTCGCCCGA GGCAACACGG GTGCTGTCCA AGATCCGTTT TGCACCCAAC CCTGATGTCT CAGATCGGGT GTTCTTTCTG CATGGCGAGA AGCTTAAGCT GATTTCAGCA CAGATTGATG GGGAAGAGGT CGCGCCCAAT GTCACCGACA CGGGCCTCAG CTGCGAGGTG CCGGACACGC CCTTCACATG GGAAGCAGAG GTCGAGATCA ACCCAAAGGC CAATACCGCG CTCGAGGGGC TTTATATGTC GAATGGCATG TATTGCACCC AATGCGAGGC CGAAGGGTTC CGCAAGATCA CCTATTATCC CGACCGCCCC GATGTCATGA GCACCTTCAA TGTCCGTATC GAGGGCGACG AAAAGGTGAT GCTGTCCAAC GGCAACCCTG GCGAAAGCGG TGAGGGCTTT GCCGAATGGC ACGATCCCTG GCCCAAACCA GCCTATCTCT TTGCATTGGT GGCGGGCGAT CTGGTCAACC ATCCCGATCG TTTCACCACG CGCTCGGGCA AGGATGTGGA ACTCAATATC TGGGTGCGGC CCGGTGACGA GGGCAAATGC GCCTTTGGGA TGGAGGCGCT CAAGAAGTCG ATGACCTGGG ACGAAGAGGT CTATGGGCGT GAGTACGATC TCGACGTGTT CAACATCGTT GCAGTCGATG ACTTCAACAT GGGCGCAATG GAGAACAAAG GATTGAACAT TTTCAACTCC TCCTGCGTTC TGGCCAGCCC TGAAACCTCG ACAGATGCCA ATTTCGAACG GATCGAGGCG ATCATCGCCC ATGAGTATTT TCACAACTGG ACCGGCAACC GGATCACCTG CCGGGACTGG TTCCAGCTGT GCCTCAAGGA AGGTCTGACA GTCTATCGCG ATGCGCAGTT CACCGCCGAC ATGCGCTCTG CTCCGGTGAA ACGGATCGAA GACGTGATCG AGTTGCGCGC GCGCCAGTTC CCCGAGGACA ACGGTCCGCT CGCCCACCCT GTCCGCCCCG AAGCCTTCCA AGAGATCAAC AATTTTTACA CCGCAACGGT CTATGAGAAG GGTGCCGAGG TGATCGGCAT GCTCAAACGT CTGGTCGGTG ATGAGGCCTA TTACAAGGCG CTCGATCTCT ATTTTGATCG TCACGATGGG CAGGCCTGCA CCATCGAGGA CTGGATCAAG GTGTTCGAGG ACTCCACGGG ACGAGATCTG GCACAATTCA AGAATTGGTA CAGCCAGGCC GGCACGCCGC GCCTCTCGGT CGAAGAGAGC TTTGAGGATG GCACCTACAC GCTCACCTTC CGGCAGATGA CTCCCCCCAC ACCTGGACAG GACCACAAGG ATCCAAAGGT GATCCCGATT GCGGTGGGGC TGCTCAGCCC CACTGGCGAC GAAGTGCTGC CAACCACCGT GCTCGAAATG ACAGAGGCAG AGCAGAGCTT TTCCTTTGAA GGCTTCAAGA CGCGACCGAT CCCGTCGATC TTGCGCGATT TCTCTGCTCC GGTGATCCTG ACCCGCGAAA GCTCTGCCAA AGAACGCGCT TTCTTGCTGG CACATGACAC CGACCCCTTT ACCCGTTGGG AAGCGGGCCG TGAGCTTGCC AAAGCCTCTC GCATCGCGAT GGTAACCGAC GGTGCCAGCC CAGACTCGAA CTATCTTGAG GCGCTGCAGT CCCTGGTGCG CGACGATCAC CTTGATCCGG CCTTTCGCGC CCTGGTTCTG GCACCTCCCA CCGAGAGCGA GATTGCACAG GCACTGGCAG ATCAGGGCGT AACACCCGAC CCGGACGCAA TCCACGACGC GGCAGAGACC TTTGCGCAGA CATTGGCACA GAGCCTGTCC GACAGCCTCC CACGCCTGTT TGCAGCCACT TTGGTCGATG GCGCCTATGT GCCGGATGCC AAGGGCGCCG GACTGAGGGC GCTCAACGGG CGTATCCTTG GACTGTTGAC CCGGATTGAT GGGGGCGAGG CCGCGACCAA GCAATTTGAG ACCGCCAACA ATATGACCGT GCAGAACTCG GCGCTGGCTT GCCTGCTCAA GGCCGAGAAG GGTGACGCGC AATCACAGGC CTTCTTTGAG CAGTGGCAAG ATGATCGTCT GGTAATGGAC AAGTGGTTCG GGCTTCAGGT GGCCACGGCC CGCCCCGAAC GCGCACCTGC CATCGCCCAG AGCCTGACCG AACATCCGCT GTTCACGATC AAGAACCCCA ACCGTTTTCG GGCTGTGATG GGGGCGTTGG CAATGAACCA TGCCGGGTTC CACAAGGCCG ACGGCAGCGG ATATCGCTTG CTCGCCGATC AGTTGATCGC GCTGGACCCG CTGAACCCGC AGACCACCGC GCGTATGTGC AGCGCCTTCC AAACATGGAA GCGCTATGAT GCCGGACGTC AGGACAAGAT CAGAGCAGAA CTCAAGCGGA TCAAGGCAAC CGAGGGGCTG AGCCGGGATA CCAACGAGAT GGTGAGCCGT ATCCTTGACG CCTGA
|
Protein sequence | MKDAAPSTQP ETFYLKDYTP FGYVVERVEL VFRLSPEATR VLSKIRFAPN PDVSDRVFFL HGEKLKLISA QIDGEEVAPN VTDTGLSCEV PDTPFTWEAE VEINPKANTA LEGLYMSNGM YCTQCEAEGF RKITYYPDRP DVMSTFNVRI EGDEKVMLSN GNPGESGEGF AEWHDPWPKP AYLFALVAGD LVNHPDRFTT RSGKDVELNI WVRPGDEGKC AFGMEALKKS MTWDEEVYGR EYDLDVFNIV AVDDFNMGAM ENKGLNIFNS SCVLASPETS TDANFERIEA IIAHEYFHNW TGNRITCRDW FQLCLKEGLT VYRDAQFTAD MRSAPVKRIE DVIELRARQF PEDNGPLAHP VRPEAFQEIN NFYTATVYEK GAEVIGMLKR LVGDEAYYKA LDLYFDRHDG QACTIEDWIK VFEDSTGRDL AQFKNWYSQA GTPRLSVEES FEDGTYTLTF RQMTPPTPGQ DHKDPKVIPI AVGLLSPTGD EVLPTTVLEM TEAEQSFSFE GFKTRPIPSI LRDFSAPVIL TRESSAKERA FLLAHDTDPF TRWEAGRELA KASRIAMVTD GASPDSNYLE ALQSLVRDDH LDPAFRALVL APPTESEIAQ ALADQGVTPD PDAIHDAAET FAQTLAQSLS DSLPRLFAAT LVDGAYVPDA KGAGLRALNG RILGLLTRID GGEAATKQFE TANNMTVQNS ALACLLKAEK GDAQSQAFFE QWQDDRLVMD KWFGLQVATA RPERAPAIAQ SLTEHPLFTI KNPNRFRAVM GALAMNHAGF HKADGSGYRL LADQLIALDP LNPQTTARMC SAFQTWKRYD AGRQDKIRAE LKRIKATEGL SRDTNEMVSR ILDA
|
| |