Gene BURPS1710b_3028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3028 
SymbolpepN 
ID3690153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3336729 
End bp3339602 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content69% 
IMG OID637729483 
Productaminopeptidase N 
Protein accessionYP_334406 
Protein GI76808653 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGAT CATACGCCGT CGCGCAAACG CCGCGCCGAT ATGGAGAAGA CCGTCGGCGC 
CGCCGAAAAG TTCGTCGCGC GCGCCTTGCG CAAAGGCTTT GTACAATGGC CTTTTGCCGC
CGCGCCGCCG CAGCGGCGCG TATTCACACT TTTCCAACGC CCACGATCGC CATGTCCGAC
ATCGCCGCTC CCAACGCCGA GATCCGCCGC AGCGACTACA CGCCGCCTGC GTTCCTGATC
GATACCGTCT CGCTCGAGTT CGATCTCGAG CCGGCGCGCA CGATCGTCAC GAACACGATG
CGCGTGCGCC GCAACCCGGA CGCCGCGCCC GCACCGCACT TCGAGCTGAT GGGCGAAGCG
CTCGTGCTGA TCGGCGCGCG CGTCGACGGC AAGCCGCACG ACGCGGTGCG CGTGCACGAA
CACGGCCTGA GCGTCGAGAA CGTGCCCGAT GCGTTCGAGC TGACGATCGA GAACGCATGC
GCGCCCGAGT CGAACACGAC GCTGTCGGGC CTGTACGTAT CGAGCGGCAA CTTCTTCACG
CAGTGCGAGG CGGAGGGCTT TCGGCGCATC ACCTACTTCG TCGACCGTCC GGACGTGATG
GCGTCGTACA CGGTCACGCT GCGCGCCGAC CAGGCCGCGT ACCCGGTGCT GCTGTCGAAC
GGCAATCTCG TCGACGCCGG CGATCTGCCG AACGGCCGTC ACTTCGCGAA GTGGGAAGAC
CCGTTCAAGA AGCCGAGCTA CCTGTTCGCA CTCGTCGCGG GCAAACTCGT CAAGCTCGAG
GAAACGATCA AGTCGGCGAG CGGCAAGGAC AAGCTCCTGC AGGTGTGGGT CGAGCCGCAG
GATCTCGGCA AGACCCGCCA CGCGATGGAT TCGCTGATCC ATTCGATCCG CTGGGACGAA
CGGCGCTTCG GCCTCGAGCT CGATCTCGAC CGCTTCATGA TCGTCGCCGT CGGCGATTTC
AACATGGGCG CGATGGAAAA CAAGGGGCTC AACATCTTCA ACACGAAGTA CGTGCTCGCG
AACCCGGAGA CGGCGACCGA CGTCGACTTC GCGAACGTCG AATCGGTCGT CGGCCACGAG
TATTTCCACA ACTGGACGGG CAACCGCGTG ACCTGCCGCG ACTGGTTCCA GTTGAGCCTG
AAGGAAGGCC TCACCGTGTT CCGCGACCAG GAGTTCTCGG CGGACATGTC CGCGGGCGCC
GAAGACGACG CCGCCGCGCG CGCGGTCAAG CGCATCGAGG ACGTGCGCGT GCTGCGCCAG
CTCCAGTTCG CCGAGGACGC GGGCCCGATG GCCCATCCGG TGCGGCCCGA GCGTTACGTC
GAGATCAACA ACTTCTACAC GATGACCGTC TACGAGAAAG GCGCGGAAGT CGTGCGGATG
TACCAGACGC TGTTCGGCCG CGACGGTTTC CGCAAGGGGA TGGACCTGTA CTTCCGGCGC
CACGACGGGC AGGCCGTCAC GTGCGACGAC TTCCGCCACG CGATGGCCGA CGCGAACGGC
CGCGACCTCG CGCTGTTCGA GCGCTGGTAC AGCCAGGCGG GCACGCCGCG CGTGACGGTT
CGCACCGCTT ACGACGCCGC CGCGAAGCGC TACGCGGTGA CGCTGCGGCA AGGCTACGGC
GACGCCGCGC CCGCCGCGCG CGACACGCAG AAAGGGCCGC TCCTGATCCC GTTCGCGATC
GGCCTGATCG GCGCCGACGG CCGCGATCTG CCGCTGCGCC TCGAAGGCGA AGCGGCCGCG
TCGGGCACGA CGCGCGTGCT CGAGCTGACC GAGGCCGAGG CGACGTTCAC GTTCGTCGAC
ATCGACGCGG CGCCGCTGCC GTCGCTGCTG CGCAATTTCT CCGCGCCCGT GATCGTCGAA
TACGACTACC GCGACGACGA GCTCGCGTTC CTGCTCGCGC ACGACAGCGA TCCGTTCAAC
CGCTGGGAGG CGGGCCAGCG CCTCGCGACG CGCGCGCTGC TCACGCTCGC GTCGCGTGCG
GCGGCGCAGC AGCCGCTCAC GCTCGACGAC GCGTTCGCCG CCGCGTTCAA GCGCGTGCTG
ACGGACGACA CGCTGTCGCC CGCGTTCCGC GAGCTCGCGC TCACGTTGCC GTCGGAGGCC
TACCTCGCCG ACCAGATGAC GCAGGCCGAT CCGGCCGCCG TCCATCGCGC GCGCCAGTTC
GTGCGCCGCC AGCTCGCGAC GGCGCTGCGC GCCGAGTGGC TCTCGGTCTA CGAGCGCCAC
CAGACGCCGG GCGCGTATGC GCCGACGCCC GGCGACGCGG GCCGCCGCGC GCTGAAGAAC
CTCGCGCTCG CCTACCTCGC CGAACTCGAC GAGCCGGCCG ACGCGATCCG GCTCGCCACC
GCGCAATACG ACGCCGCGAA CAACATGACC GACCGCGCGT GCGCGCTCGT CGCGCTGCTG
TCGGCCGCCG CCGCGTCGGC CGACGCGGCG CGCGCCGCCG ATCGCGCGCT CGACGATTTC
TATCGCCGCT TCGAGAACGA AGCGCTCGTG ATCGACAAGT GGTTCTCGAT GCAGGCGACG
CGGCGCGGCA CGCCCGAGCA TCCGACGCTC GACATCGTGC GCAAGCTGCT CGCGCATCCG
GCGTTCAACC TGAAGAACCC GAACCGCGCA CGCTCGCTGA TCTTCGGCTT CTGCTCGGCG
AATCCCGCGC AGTTCCATGC GGCCGACGGC TCGGGCTATG CGTTCTGGGC CGATCAGGTG
CTCGCGCTCG ACGCGCTTAA TCCGCAGATC GCCGCGCGGC TTGCGCGCGC GCTCGAGCTG
TGGCGCCGCT TCACGCCGTC GCTGCGCGAG AAGATGCGCG ACGCGCTCGA GCGCGTCGCC
GCGAACGCGC AGTCGCGCGA CGTGCGGGAG ATCGTCGAGA AGGCGCTCGC CTGA
 
Protein sequence
MTGSYAVAQT PRRYGEDRRR RRKVRRARLA QRLCTMAFCR RAAAAARIHT FPTPTIAMSD 
IAAPNAEIRR SDYTPPAFLI DTVSLEFDLE PARTIVTNTM RVRRNPDAAP APHFELMGEA
LVLIGARVDG KPHDAVRVHE HGLSVENVPD AFELTIENAC APESNTTLSG LYVSSGNFFT
QCEAEGFRRI TYFVDRPDVM ASYTVTLRAD QAAYPVLLSN GNLVDAGDLP NGRHFAKWED
PFKKPSYLFA LVAGKLVKLE ETIKSASGKD KLLQVWVEPQ DLGKTRHAMD SLIHSIRWDE
RRFGLELDLD RFMIVAVGDF NMGAMENKGL NIFNTKYVLA NPETATDVDF ANVESVVGHE
YFHNWTGNRV TCRDWFQLSL KEGLTVFRDQ EFSADMSAGA EDDAAARAVK RIEDVRVLRQ
LQFAEDAGPM AHPVRPERYV EINNFYTMTV YEKGAEVVRM YQTLFGRDGF RKGMDLYFRR
HDGQAVTCDD FRHAMADANG RDLALFERWY SQAGTPRVTV RTAYDAAAKR YAVTLRQGYG
DAAPAARDTQ KGPLLIPFAI GLIGADGRDL PLRLEGEAAA SGTTRVLELT EAEATFTFVD
IDAAPLPSLL RNFSAPVIVE YDYRDDELAF LLAHDSDPFN RWEAGQRLAT RALLTLASRA
AAQQPLTLDD AFAAAFKRVL TDDTLSPAFR ELALTLPSEA YLADQMTQAD PAAVHRARQF
VRRQLATALR AEWLSVYERH QTPGAYAPTP GDAGRRALKN LALAYLAELD EPADAIRLAT
AQYDAANNMT DRACALVALL SAAAASADAA RAADRALDDF YRRFENEALV IDKWFSMQAT
RRGTPEHPTL DIVRKLLAHP AFNLKNPNRA RSLIFGFCSA NPAQFHAADG SGYAFWADQV
LALDALNPQI AARLARALEL WRRFTPSLRE KMRDALERVA ANAQSRDVRE IVEKALA