Gene BURPS1710b_3622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3622 
SymbolpepN 
ID3689958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3943391 
End bp3945631 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content64% 
IMG OID637730077 
Productpeptidase 
Protein accessionYP_334987 
Protein GI76811949 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.178496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACGGCGA AGCCGAATTA TTGCCCGTCG AATTCATGCA GGACCGAACG ATTTCTTTTC 
AATTCGAGGT TCAGTATGTG GATAAACAAT CGTAGATTGC GATATCTGTC CCTGCTCGGC
GCGCTCGCGC TCGCCGCGTG CGGCGGCGAC GACGGCGGCA CCGGCTCGGC CGTGTCGCTC
GGCGCCGCGC ACAGCCCGTC GAACGGCGCA AACGGCGCAA ACGGCACGAA CGGCGGCGCG
GCCACGCCCG CCGCCGTCGA CAAAAGCACG AAGCCCGTCG AAATGCCCGA CACCGTCGTG
CCCGTCAACT ACAAGCTCTG GTTCCGCCCG AACGCGGACC TGAATCAATT CAGCGGCCGC
GCGGACGTCG AGATCAAGGT GCTCAAGCCC GTCAACGCGA TCGTCGTCGC GGGCCACCGG
ATCCAGTTCA CGAACGGCAA GACCACGCTG CAGCCCGGCA ACGTGCAACT GGTCGCGACG
CCGCAGGACA AGGGCGATTT CTATCAACTG CGTCCCGCGA GCGGCCAGAT CGCCCCGGGC
AACTACTCGC TGCACATGGA GTGGCAGGGG ATCATCAACT TCAAGTCGTA CGACGACCCG
GTCAATCACA CGGGCGGCAG CTGCGGCAAC GATCCATATC CGGGCTGCTC GGCGGCCGAG
GGGATCTTCC GCGTCGACCT GAAGAGCACC GACGGCACGA CGAGCGGCGC GATCCTCACG
CAAGGCGAGA CGAACCTGTC GCGCCAATGG TTCCCGGGCT GGGACGAGCC CGCGTTCCGT
CCGACCTATG AAGTGACGGC CGAAGTGCCG CAGGCATGGC GCGTCGTGTC GAACGCGGCC
GAACTGCCGT CGGTAAACGT CGGCGGCGGC TACAAGCTGG TGTCGTTCGA GAAGACGCCG
CCGATGCCGT CGTATCTGCT GTTCTTCGGC GGCGGCCTGT TCGACGTGCT CGAAGACGAC
TTCTCGAGCC CGCTGCCGAA CAGCAACGGC CTGCACCTGC GCATCTTCAC GCCGCCCGGC
ATGCGCGAAT GGGCACGCCC CGCGATGCAA CGCACGAAGC AGGCGCTCGA TTACTACTAT
CGCTACACCG GCATTGCGCT GCCGCTCAAG AAGTTCGACA CGGTGGCCGC GAACGATGCG
TTCAAGGATC AGAAGGGCTT GAACTTCGGC GGCATGGAGA ACTGGGGCTC GATTCTCGAG
TTCGCCGACG ACATCCTGCC CGAACCGGGC AAGCCGATGT CACGCTACGG CAACCAGGTG
CTCACGCACG AAGTCGCGCA CCAATGGTTC GGCGATCTCG TGACGACCGA TTGGTGGGAC
GACGTTTGGC TGAACGAATC GTTCGCGCGC TTCTTCGAAA CGAAGACGAC GATCCAGTTC
TTCCCGGACG AGTTCAACTG GCTCGATCAC ATCAAGTCCA AGTATCGCGT GATCAACAAG
GACATCAGCC AGGACGCGTT CCCGATTCAG CCGAACTTCA ACGGCTGGGC ATCGAACGAC
TTCGTCATCA GCGCGAGTTC GTTCGTCTAT AACAAGGGCG GCATGGTGCT GAAGATGCTC
GAGGGCTACC TCGGCGAGCA GACGCTGCGC AAGGGCCTGC AGCAATACCT GAACGACTAT
GCGTTCGGAA ACGGCACGCC CAAGCGCCTG TGGGACGCGC TGTCCGCCGC GAGCGGCCAG
CAGGTCGGCC CGATCGGCGA CAGCTTCGTG CGCCAGACGG GCGTGCCGCT TCTCGCGCTC
GACACGCAAT GCGATCTGAC GAAGAACCAG AACGTCGTGA CGCTCACGCA GTCGCCGTTC
CCGAACAAGA ACAAGTATCC GGGCGCGCAA TGGACGATCC CCGTCACGCT CGCGTACGGC
GACGGCCTCG TCAACCGCAA GACGCTCGCG CTGAAGGATA CGCAGACGCA GATCCGCCTC
GACGGCTGCT CGGCCGTGGT CGCGAATCCG ACCGGGTTCG ATTACTACGT GACGAACTAC
AGCGACGCCG CGTGGAGCGC GCTGCTCACG CAGATCAATG CGTCGACGGA CCCGGTGCTG
CTGCTGAACC TGAAGAGCGA GGCTGCGCTG CTCGTCGCGT CGAATCTCGC GCCGCCTTCC
CGCGCCACGA GCATCTCGTC GATCAACTCG CCGGCCGCGA TGAAGCTGCG CCAGGTGCCG
TCGATCCTCG AGACGCCGAA GGAACGCCCG CAACTGCGTT ACCAAGGCAA GTTCACGCCG
CGGCAACAGC GGACGGAATA A
 
Protein sequence
MTAKPNYCPS NSCRTERFLF NSRFSMWINN RRLRYLSLLG ALALAACGGD DGGTGSAVSL 
GAAHSPSNGA NGANGTNGGA ATPAAVDKST KPVEMPDTVV PVNYKLWFRP NADLNQFSGR
ADVEIKVLKP VNAIVVAGHR IQFTNGKTTL QPGNVQLVAT PQDKGDFYQL RPASGQIAPG
NYSLHMEWQG IINFKSYDDP VNHTGGSCGN DPYPGCSAAE GIFRVDLKST DGTTSGAILT
QGETNLSRQW FPGWDEPAFR PTYEVTAEVP QAWRVVSNAA ELPSVNVGGG YKLVSFEKTP
PMPSYLLFFG GGLFDVLEDD FSSPLPNSNG LHLRIFTPPG MREWARPAMQ RTKQALDYYY
RYTGIALPLK KFDTVAANDA FKDQKGLNFG GMENWGSILE FADDILPEPG KPMSRYGNQV
LTHEVAHQWF GDLVTTDWWD DVWLNESFAR FFETKTTIQF FPDEFNWLDH IKSKYRVINK
DISQDAFPIQ PNFNGWASND FVISASSFVY NKGGMVLKML EGYLGEQTLR KGLQQYLNDY
AFGNGTPKRL WDALSAASGQ QVGPIGDSFV RQTGVPLLAL DTQCDLTKNQ NVVTLTQSPF
PNKNKYPGAQ WTIPVTLAYG DGLVNRKTLA LKDTQTQIRL DGCSAVVANP TGFDYYVTNY
SDAAWSALLT QINASTDPVL LLNLKSEAAL LVASNLAPPS RATSISSINS PAAMKLRQVP
SILETPKERP QLRYQGKFTP RQQRTE