Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_3028 |
Symbol | pepN |
ID | 3690153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 3336729 |
End bp | 3339602 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637729483 |
Product | aminopeptidase N |
Protein accession | YP_334406 |
Protein GI | 76808653 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAT CATACGCCGT CGCGCAAACG CCGCGCCGAT ATGGAGAAGA CCGTCGGCGC CGCCGAAAAG TTCGTCGCGC GCGCCTTGCG CAAAGGCTTT GTACAATGGC CTTTTGCCGC CGCGCCGCCG CAGCGGCGCG TATTCACACT TTTCCAACGC CCACGATCGC CATGTCCGAC ATCGCCGCTC CCAACGCCGA GATCCGCCGC AGCGACTACA CGCCGCCTGC GTTCCTGATC GATACCGTCT CGCTCGAGTT CGATCTCGAG CCGGCGCGCA CGATCGTCAC GAACACGATG CGCGTGCGCC GCAACCCGGA CGCCGCGCCC GCACCGCACT TCGAGCTGAT GGGCGAAGCG CTCGTGCTGA TCGGCGCGCG CGTCGACGGC AAGCCGCACG ACGCGGTGCG CGTGCACGAA CACGGCCTGA GCGTCGAGAA CGTGCCCGAT GCGTTCGAGC TGACGATCGA GAACGCATGC GCGCCCGAGT CGAACACGAC GCTGTCGGGC CTGTACGTAT CGAGCGGCAA CTTCTTCACG CAGTGCGAGG CGGAGGGCTT TCGGCGCATC ACCTACTTCG TCGACCGTCC GGACGTGATG GCGTCGTACA CGGTCACGCT GCGCGCCGAC CAGGCCGCGT ACCCGGTGCT GCTGTCGAAC GGCAATCTCG TCGACGCCGG CGATCTGCCG AACGGCCGTC ACTTCGCGAA GTGGGAAGAC CCGTTCAAGA AGCCGAGCTA CCTGTTCGCA CTCGTCGCGG GCAAACTCGT CAAGCTCGAG GAAACGATCA AGTCGGCGAG CGGCAAGGAC AAGCTCCTGC AGGTGTGGGT CGAGCCGCAG GATCTCGGCA AGACCCGCCA CGCGATGGAT TCGCTGATCC ATTCGATCCG CTGGGACGAA CGGCGCTTCG GCCTCGAGCT CGATCTCGAC CGCTTCATGA TCGTCGCCGT CGGCGATTTC AACATGGGCG CGATGGAAAA CAAGGGGCTC AACATCTTCA ACACGAAGTA CGTGCTCGCG AACCCGGAGA CGGCGACCGA CGTCGACTTC GCGAACGTCG AATCGGTCGT CGGCCACGAG TATTTCCACA ACTGGACGGG CAACCGCGTG ACCTGCCGCG ACTGGTTCCA GTTGAGCCTG AAGGAAGGCC TCACCGTGTT CCGCGACCAG GAGTTCTCGG CGGACATGTC CGCGGGCGCC GAAGACGACG CCGCCGCGCG CGCGGTCAAG CGCATCGAGG ACGTGCGCGT GCTGCGCCAG CTCCAGTTCG CCGAGGACGC GGGCCCGATG GCCCATCCGG TGCGGCCCGA GCGTTACGTC GAGATCAACA ACTTCTACAC GATGACCGTC TACGAGAAAG GCGCGGAAGT CGTGCGGATG TACCAGACGC TGTTCGGCCG CGACGGTTTC CGCAAGGGGA TGGACCTGTA CTTCCGGCGC CACGACGGGC AGGCCGTCAC GTGCGACGAC TTCCGCCACG CGATGGCCGA CGCGAACGGC CGCGACCTCG CGCTGTTCGA GCGCTGGTAC AGCCAGGCGG GCACGCCGCG CGTGACGGTT CGCACCGCTT ACGACGCCGC CGCGAAGCGC TACGCGGTGA CGCTGCGGCA AGGCTACGGC GACGCCGCGC CCGCCGCGCG CGACACGCAG AAAGGGCCGC TCCTGATCCC GTTCGCGATC GGCCTGATCG GCGCCGACGG CCGCGATCTG CCGCTGCGCC TCGAAGGCGA AGCGGCCGCG TCGGGCACGA CGCGCGTGCT CGAGCTGACC GAGGCCGAGG CGACGTTCAC GTTCGTCGAC ATCGACGCGG CGCCGCTGCC GTCGCTGCTG CGCAATTTCT CCGCGCCCGT GATCGTCGAA TACGACTACC GCGACGACGA GCTCGCGTTC CTGCTCGCGC ACGACAGCGA TCCGTTCAAC CGCTGGGAGG CGGGCCAGCG CCTCGCGACG CGCGCGCTGC TCACGCTCGC GTCGCGTGCG GCGGCGCAGC AGCCGCTCAC GCTCGACGAC GCGTTCGCCG CCGCGTTCAA GCGCGTGCTG ACGGACGACA CGCTGTCGCC CGCGTTCCGC GAGCTCGCGC TCACGTTGCC GTCGGAGGCC TACCTCGCCG ACCAGATGAC GCAGGCCGAT CCGGCCGCCG TCCATCGCGC GCGCCAGTTC GTGCGCCGCC AGCTCGCGAC GGCGCTGCGC GCCGAGTGGC TCTCGGTCTA CGAGCGCCAC CAGACGCCGG GCGCGTATGC GCCGACGCCC GGCGACGCGG GCCGCCGCGC GCTGAAGAAC CTCGCGCTCG CCTACCTCGC CGAACTCGAC GAGCCGGCCG ACGCGATCCG GCTCGCCACC GCGCAATACG ACGCCGCGAA CAACATGACC GACCGCGCGT GCGCGCTCGT CGCGCTGCTG TCGGCCGCCG CCGCGTCGGC CGACGCGGCG CGCGCCGCCG ATCGCGCGCT CGACGATTTC TATCGCCGCT TCGAGAACGA AGCGCTCGTG ATCGACAAGT GGTTCTCGAT GCAGGCGACG CGGCGCGGCA CGCCCGAGCA TCCGACGCTC GACATCGTGC GCAAGCTGCT CGCGCATCCG GCGTTCAACC TGAAGAACCC GAACCGCGCA CGCTCGCTGA TCTTCGGCTT CTGCTCGGCG AATCCCGCGC AGTTCCATGC GGCCGACGGC TCGGGCTATG CGTTCTGGGC CGATCAGGTG CTCGCGCTCG ACGCGCTTAA TCCGCAGATC GCCGCGCGGC TTGCGCGCGC GCTCGAGCTG TGGCGCCGCT TCACGCCGTC GCTGCGCGAG AAGATGCGCG ACGCGCTCGA GCGCGTCGCC GCGAACGCGC AGTCGCGCGA CGTGCGGGAG ATCGTCGAGA AGGCGCTCGC CTGA
|
Protein sequence | MTGSYAVAQT PRRYGEDRRR RRKVRRARLA QRLCTMAFCR RAAAAARIHT FPTPTIAMSD IAAPNAEIRR SDYTPPAFLI DTVSLEFDLE PARTIVTNTM RVRRNPDAAP APHFELMGEA LVLIGARVDG KPHDAVRVHE HGLSVENVPD AFELTIENAC APESNTTLSG LYVSSGNFFT QCEAEGFRRI TYFVDRPDVM ASYTVTLRAD QAAYPVLLSN GNLVDAGDLP NGRHFAKWED PFKKPSYLFA LVAGKLVKLE ETIKSASGKD KLLQVWVEPQ DLGKTRHAMD SLIHSIRWDE RRFGLELDLD RFMIVAVGDF NMGAMENKGL NIFNTKYVLA NPETATDVDF ANVESVVGHE YFHNWTGNRV TCRDWFQLSL KEGLTVFRDQ EFSADMSAGA EDDAAARAVK RIEDVRVLRQ LQFAEDAGPM AHPVRPERYV EINNFYTMTV YEKGAEVVRM YQTLFGRDGF RKGMDLYFRR HDGQAVTCDD FRHAMADANG RDLALFERWY SQAGTPRVTV RTAYDAAAKR YAVTLRQGYG DAAPAARDTQ KGPLLIPFAI GLIGADGRDL PLRLEGEAAA SGTTRVLELT EAEATFTFVD IDAAPLPSLL RNFSAPVIVE YDYRDDELAF LLAHDSDPFN RWEAGQRLAT RALLTLASRA AAQQPLTLDD AFAAAFKRVL TDDTLSPAFR ELALTLPSEA YLADQMTQAD PAAVHRARQF VRRQLATALR AEWLSVYERH QTPGAYAPTP GDAGRRALKN LALAYLAELD EPADAIRLAT AQYDAANNMT DRACALVALL SAAAASADAA RAADRALDDF YRRFENEALV IDKWFSMQAT RRGTPEHPTL DIVRKLLAHP AFNLKNPNRA RSLIFGFCSA NPAQFHAADG SGYAFWADQV LALDALNPQI AARLARALEL WRRFTPSLRE KMRDALERVA ANAQSRDVRE IVEKALA
|
| |