Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_3622 |
Symbol | pepN |
ID | 3689958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 3943391 |
End bp | 3945631 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637730077 |
Product | peptidase |
Protein accession | YP_334987 |
Protein GI | 76811949 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.178496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACGGCGA AGCCGAATTA TTGCCCGTCG AATTCATGCA GGACCGAACG ATTTCTTTTC AATTCGAGGT TCAGTATGTG GATAAACAAT CGTAGATTGC GATATCTGTC CCTGCTCGGC GCGCTCGCGC TCGCCGCGTG CGGCGGCGAC GACGGCGGCA CCGGCTCGGC CGTGTCGCTC GGCGCCGCGC ACAGCCCGTC GAACGGCGCA AACGGCGCAA ACGGCACGAA CGGCGGCGCG GCCACGCCCG CCGCCGTCGA CAAAAGCACG AAGCCCGTCG AAATGCCCGA CACCGTCGTG CCCGTCAACT ACAAGCTCTG GTTCCGCCCG AACGCGGACC TGAATCAATT CAGCGGCCGC GCGGACGTCG AGATCAAGGT GCTCAAGCCC GTCAACGCGA TCGTCGTCGC GGGCCACCGG ATCCAGTTCA CGAACGGCAA GACCACGCTG CAGCCCGGCA ACGTGCAACT GGTCGCGACG CCGCAGGACA AGGGCGATTT CTATCAACTG CGTCCCGCGA GCGGCCAGAT CGCCCCGGGC AACTACTCGC TGCACATGGA GTGGCAGGGG ATCATCAACT TCAAGTCGTA CGACGACCCG GTCAATCACA CGGGCGGCAG CTGCGGCAAC GATCCATATC CGGGCTGCTC GGCGGCCGAG GGGATCTTCC GCGTCGACCT GAAGAGCACC GACGGCACGA CGAGCGGCGC GATCCTCACG CAAGGCGAGA CGAACCTGTC GCGCCAATGG TTCCCGGGCT GGGACGAGCC CGCGTTCCGT CCGACCTATG AAGTGACGGC CGAAGTGCCG CAGGCATGGC GCGTCGTGTC GAACGCGGCC GAACTGCCGT CGGTAAACGT CGGCGGCGGC TACAAGCTGG TGTCGTTCGA GAAGACGCCG CCGATGCCGT CGTATCTGCT GTTCTTCGGC GGCGGCCTGT TCGACGTGCT CGAAGACGAC TTCTCGAGCC CGCTGCCGAA CAGCAACGGC CTGCACCTGC GCATCTTCAC GCCGCCCGGC ATGCGCGAAT GGGCACGCCC CGCGATGCAA CGCACGAAGC AGGCGCTCGA TTACTACTAT CGCTACACCG GCATTGCGCT GCCGCTCAAG AAGTTCGACA CGGTGGCCGC GAACGATGCG TTCAAGGATC AGAAGGGCTT GAACTTCGGC GGCATGGAGA ACTGGGGCTC GATTCTCGAG TTCGCCGACG ACATCCTGCC CGAACCGGGC AAGCCGATGT CACGCTACGG CAACCAGGTG CTCACGCACG AAGTCGCGCA CCAATGGTTC GGCGATCTCG TGACGACCGA TTGGTGGGAC GACGTTTGGC TGAACGAATC GTTCGCGCGC TTCTTCGAAA CGAAGACGAC GATCCAGTTC TTCCCGGACG AGTTCAACTG GCTCGATCAC ATCAAGTCCA AGTATCGCGT GATCAACAAG GACATCAGCC AGGACGCGTT CCCGATTCAG CCGAACTTCA ACGGCTGGGC ATCGAACGAC TTCGTCATCA GCGCGAGTTC GTTCGTCTAT AACAAGGGCG GCATGGTGCT GAAGATGCTC GAGGGCTACC TCGGCGAGCA GACGCTGCGC AAGGGCCTGC AGCAATACCT GAACGACTAT GCGTTCGGAA ACGGCACGCC CAAGCGCCTG TGGGACGCGC TGTCCGCCGC GAGCGGCCAG CAGGTCGGCC CGATCGGCGA CAGCTTCGTG CGCCAGACGG GCGTGCCGCT TCTCGCGCTC GACACGCAAT GCGATCTGAC GAAGAACCAG AACGTCGTGA CGCTCACGCA GTCGCCGTTC CCGAACAAGA ACAAGTATCC GGGCGCGCAA TGGACGATCC CCGTCACGCT CGCGTACGGC GACGGCCTCG TCAACCGCAA GACGCTCGCG CTGAAGGATA CGCAGACGCA GATCCGCCTC GACGGCTGCT CGGCCGTGGT CGCGAATCCG ACCGGGTTCG ATTACTACGT GACGAACTAC AGCGACGCCG CGTGGAGCGC GCTGCTCACG CAGATCAATG CGTCGACGGA CCCGGTGCTG CTGCTGAACC TGAAGAGCGA GGCTGCGCTG CTCGTCGCGT CGAATCTCGC GCCGCCTTCC CGCGCCACGA GCATCTCGTC GATCAACTCG CCGGCCGCGA TGAAGCTGCG CCAGGTGCCG TCGATCCTCG AGACGCCGAA GGAACGCCCG CAACTGCGTT ACCAAGGCAA GTTCACGCCG CGGCAACAGC GGACGGAATA A
|
Protein sequence | MTAKPNYCPS NSCRTERFLF NSRFSMWINN RRLRYLSLLG ALALAACGGD DGGTGSAVSL GAAHSPSNGA NGANGTNGGA ATPAAVDKST KPVEMPDTVV PVNYKLWFRP NADLNQFSGR ADVEIKVLKP VNAIVVAGHR IQFTNGKTTL QPGNVQLVAT PQDKGDFYQL RPASGQIAPG NYSLHMEWQG IINFKSYDDP VNHTGGSCGN DPYPGCSAAE GIFRVDLKST DGTTSGAILT QGETNLSRQW FPGWDEPAFR PTYEVTAEVP QAWRVVSNAA ELPSVNVGGG YKLVSFEKTP PMPSYLLFFG GGLFDVLEDD FSSPLPNSNG LHLRIFTPPG MREWARPAMQ RTKQALDYYY RYTGIALPLK KFDTVAANDA FKDQKGLNFG GMENWGSILE FADDILPEPG KPMSRYGNQV LTHEVAHQWF GDLVTTDWWD DVWLNESFAR FFETKTTIQF FPDEFNWLDH IKSKYRVINK DISQDAFPIQ PNFNGWASND FVISASSFVY NKGGMVLKML EGYLGEQTLR KGLQQYLNDY AFGNGTPKRL WDALSAASGQ QVGPIGDSFV RQTGVPLLAL DTQCDLTKNQ NVVTLTQSPF PNKNKYPGAQ WTIPVTLAYG DGLVNRKTLA LKDTQTQIRL DGCSAVVANP TGFDYYVTNY SDAAWSALLT QINASTDPVL LLNLKSEAAL LVASNLAPPS RATSISSINS PAAMKLRQVP SILETPKERP QLRYQGKFTP RQQRTE
|
| |