Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1608 |
Symbol | pepN |
ID | 3847927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 1806630 |
End bp | 1809389 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841278 |
Product | aminopeptidase N |
Protein accession | YP_442146 |
Protein GI | 83719324 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.535966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTCTT GCCGCCGCGC CGCGCGCCCG GCGCGGACCC AACCTTATCC TATGCCCACG ATCGCCATGT CCGACACCGC CGCTCCCTCC GCCGTGATCC GCCGCAGCGA CTACACGCCG CCTGCGTTCC TGATCGATAC CGTCGCGCTC GAGTTCGATC TCGAGCCCGC GCGCACCATC GTCAAGAACA CGATGCGCGT GCGCCGCAAT CCGGATGCCG CGCCCGCGCC GCACTTCGAG CTGATGGGCG AGGCGCTCGT GCTCATCGGC GCGCATCTCG ACGGCAAGCC GTACGACGCG GTGCGCGCGC ACGAGCATGG CGTGACCGTC GAGAACGTGC CCGACGCGTT CGAGCTGACG ATCGAGAACG CGTGCGCGCC CGAATCGAAC ACGACGCTCT CGGGCCTGTA CGTCTCGAGC GGCAATTTCT TCACCCAGTG CGAGGCGGAG GGCTTCCGGC GCATCACCTA CTTCGTCGAC CGCCCGGACG TGATGGCGTC GTACACGGCC ACGCTGCGCG CCGACAAGGC CGCGTATCCG GTGCTGCTGT CGAACGGCAA TCTCGTCGAT TCCGGCGATC TGCCGGACGG CCGCCACTTC GCGAAGTGGG AAGACCCGTT CAAGAAGCCG AGCTACCTGT TCGCGCTCGT CGCGGGCAAG CTCGTCAAGC TCGAGGAAAC GATCAAATCG GCGAGCGGAA AGGACAAGCT GCTGCAGGTG TGGGTCGAGC CGCAGGATCT CGACAAGACC CGCCACGCGA TGGATTCGCT GATCCATTCG ATCCGCTGGG ACGAACGGCG CTTCGGCCTC GAGCTCGATC TCGACCGCTT CATGATCGTC GCCGTCGGCG ATTTCAACAT GGGCGCGATG GAAAACAAGG GGCTCAACAT CTTCAACACG AAGTACGTGC TCGCGAACCC GGAAACGGCG ACCGACGTCG ACTTCGCGAA CGTCGAATCG GTCGTCGGCC ACGAGTATTT CCACAACTGG ACAGGCAACC GCGTGACCTG CCGCGACTGG TTCCAGTTGA GCCTGAAGGA AGGCCTCACC GTGTTCCGCG ACCAGGAGTT CTCGGCCGAC ATGTCCGCGG GCGCCGAAGA CGACGCCGCC GCGCGCGCGG TCAAGCGCAT CGAGGACGTG CGCGTGCTGC GCCAGCTCCA GTTCGCCGAG GACGCCGGCC CGATGGCGCA TCCGGTGCGG CCCGAGAGTT ACGTCGAGAT CAACAACTTC TACACGATGA CCGTCTACGA GAAAGGCGCG GAAGTCGTGC GGATGTATCA GACGCTGTTC GGCCGCGACG GCTTTCGCAA GGGGATGGAC CTGTACTTCC GGCGCCACGA CGGGCAGGCC GTCACGTGCG ACGACTTCCG CCACGCAATG GCCGACGCGA ACGGCCGCGA TCTCGCGCTG TTCGAGCGCT GGTACAGCCA GGCGGGCACG CCGCGCGTGA CGGTTCGCAC CGCTTACGAC GCCGCCGCGA AGCGCTACGC GGTGACGCTG CGGCAAGGCT ACGGCGATGC CGCGCGCGAC ACGCAGAAAG GACCGCTGCT GATTCCGTTC GCGATCGGCC TGATCGGCGC CGACGGCCGC GACCTGCCGC TGCGCCTCGA AGGCGAGGCG GCCACGTCGG GCACGACGCG CGTGCTCGAT CTGACCGAGA CCGAGACGAC GTTCACGTTC GTCGACGTCG ACGAAGCGCC GCTGCCGTCG CTGCTGCGCA ATTTCTCGGC GCCCGTGATC GTCGAATACG ACTACCGCGA CGACGAGCTC GCATTCCTGC TCGCGCACGA CGGCGATCCG TTCAACCGCT GGGAGGCGGG CCAGCGCCTC GCGACGCGCG CGCTGCTCAC GCTCGCCGCG CGCGCGGCCG CGCAGCAGCC GCTCGCGCTC GACGACGCGT TCGTCGCCGC GTTCAAGCGC GTGCTGACGA ACGACACGCT GTCGCCCGCG TTTCGCGAGC TCGCGCTCAC GCTGCCGTCG GAAGCCTATC TCGCCGACCA GATGACGCAA GCCGATCCGG CCGCCGTTCA TCGCGCGCGC CAGTTCGTGC GCCGTCAGCT CGCGAACGCG CTGCGCGCCG ACTGGCTCGC GGTCTACGAC CGCCATCAGA CACCGGGCGC CTATGCGCCG ACGCCCGACG ACGCTGGCCG TCGCGCGCTG AAGAACCTCG CGCTCGCCTA CCTCGCCGAG CTCGACGAAC CGGCGGACGC GATCCGTCTC GCCACCGCGC AATACGACGC CGCGAACAAC ATGACGGATC GCGCATCCGC GCTCGTCGCG CTGCTGTCGG CCGCCGCCGC CTCGGCCGAC GCGGCGCGCG CCGCCGATCG TGCGCTCGAG GACTTCTATC GCCGCTTCGA GAAGGAAGCG CTCGTGATCG ACAAGTGGTT CTCGATGCAG GCCACCCGGC GCGGCACGCC CGAGCATCCG ACGCTCGACA TCGTGCGCAA GCTGCTCGCG CATCCTGCGT TCAACCTGAG GAATCCTAAC CGCGCGCGAT CGCTGATCTT CGGCTTCTGC TCGGCGAATC CGGCGCAGTT CCATGCGGCC GACGGCTCGG GCTACGCGTT CTGGGCCGAT CAGGTGCTCT CGCTCGACGC GCTCAATCCG CAGGTTGCCG CGCGGCTCGC GCGCGCGCTC GAGCTGTGGC GCCGCTTCAC GCCGTCGCTG CGCGACAAGA TGCGCGAAGC GCTCGAACGC GTCGCCGCGA ACGCGCAGTC GCGCGACGTG CGGGAGATCG TCGAGAAGGC GCTCGCCTGA
|
Protein sequence | MGSCRRAARP ARTQPYPMPT IAMSDTAAPS AVIRRSDYTP PAFLIDTVAL EFDLEPARTI VKNTMRVRRN PDAAPAPHFE LMGEALVLIG AHLDGKPYDA VRAHEHGVTV ENVPDAFELT IENACAPESN TTLSGLYVSS GNFFTQCEAE GFRRITYFVD RPDVMASYTA TLRADKAAYP VLLSNGNLVD SGDLPDGRHF AKWEDPFKKP SYLFALVAGK LVKLEETIKS ASGKDKLLQV WVEPQDLDKT RHAMDSLIHS IRWDERRFGL ELDLDRFMIV AVGDFNMGAM ENKGLNIFNT KYVLANPETA TDVDFANVES VVGHEYFHNW TGNRVTCRDW FQLSLKEGLT VFRDQEFSAD MSAGAEDDAA ARAVKRIEDV RVLRQLQFAE DAGPMAHPVR PESYVEINNF YTMTVYEKGA EVVRMYQTLF GRDGFRKGMD LYFRRHDGQA VTCDDFRHAM ADANGRDLAL FERWYSQAGT PRVTVRTAYD AAAKRYAVTL RQGYGDAARD TQKGPLLIPF AIGLIGADGR DLPLRLEGEA ATSGTTRVLD LTETETTFTF VDVDEAPLPS LLRNFSAPVI VEYDYRDDEL AFLLAHDGDP FNRWEAGQRL ATRALLTLAA RAAAQQPLAL DDAFVAAFKR VLTNDTLSPA FRELALTLPS EAYLADQMTQ ADPAAVHRAR QFVRRQLANA LRADWLAVYD RHQTPGAYAP TPDDAGRRAL KNLALAYLAE LDEPADAIRL ATAQYDAANN MTDRASALVA LLSAAAASAD AARAADRALE DFYRRFEKEA LVIDKWFSMQ ATRRGTPEHP TLDIVRKLLA HPAFNLRNPN RARSLIFGFC SANPAQFHAA DGSGYAFWAD QVLSLDALNP QVAARLARAL ELWRRFTPSL RDKMREALER VAANAQSRDV REIVEKALA
|
| |