Gene BURPS668_2922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2922 
SymbolpepN 
ID4883469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2879357 
End bp2882059 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content69% 
IMG OID640128850 
Productaminopeptidase N 
Protein accessionYP_001059940 
Protein GI126442198 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACA TCGCCGCTCC CAACGCCGAG ATCCGCCGCA GCGACTACAC GCCGCCTGCG 
TTCCTGATCG ATACCGTCTC GCTCGAGTTC GATCTCGAGC CGGCGCGCAC GATCGTCACG
AACACGATGC GCGTGCGCCG CAACCCGGAC GCCGCGCCCG CACCGCACTT CGAGCTGATG
GGCGAAGCGC TCGTGCTGAT CGGCGCGCGC GTCGACGGCA AGCCGCACGA CGCGGTGCGC
GTGCACGAAC ACGGCCTGAG CGTCGAGAAC GTGCCCGATG CGTTCGAGCT GACGATCGAG
AACGCATGCG CGCCCGAGTC GAACACGACG CTGTCGGGCC TGTACGTATC GAGCGGCAAC
TTCTTCACGC AGTGCGAGGC GGAGGGCTTT CGGCGCATCA CCTACTTCGT CGACCGTCCG
GACGTGATGG CGTCGTACAC GGTCACGCTG CGCGCCGACC AGGCCGCGTA CCCGGTGCTG
CTGTCGAACG GCAATCTCGT CGACGCCGGC GATCTGCCGA ACGGCCGTCA CTTCGCGAAG
TGGGAAGACC CGTTCAAGAA GCCGAGCTAC CTGTTCGCAC TCGTCGCGGG CAAACTCGTC
AAGCTCGAGG AAACGATCAA GTCGGCGAGC GGCAAGGACA AGCTCCTGCA GGTGTGGGTC
GAGCCGCAGG ATCTCGGCAA GACCCGCCAC GCGATGGATT CGCTGATCCA TTCGATCCGC
TGGGACGAAC GGCGCTTCGG CCTCGAGCTC GATCTCGACC GCTTCATGAT CGTCGCCGTC
GGCGATTTCA ACATGGGCGC GATGGAAAAC AAGGGGCTCA ACATCTTCAA CACGAAGTAC
GTGCTCGCGA ACCCGGAGAC GGCGACCGAC GTCGACTTCG CGAACGTCGA ATCGGTCGTC
GGCCACGAGT ATTTCCACAA CTGGACGGGC AACCGCGTGA CCTGCCGCGA CTGGTTCCAG
TTGAGCCTGA AGGAAGGCCT CACCGTGTTC CGCGACCAGG AGTTCTCGGC GGACATGTCA
GCGGGCGCCG AAGACGACGC CGCCGCGCGC GCGGTCAAGC GCATCGAGGA CGTGCGCGTG
CTGCGCCAGC TCCAGTTCGC CGAGGACGCG GGCCCGATGG CCCATCCGGT GCGGCCCGAG
CGTTACGTCG AGATCAACAA CTTCTACACG ATGACCGTCT ACGAGAAAGG CGCGGAAGTC
GTGCGGATGT ACCAGACGCT GTTCGGCCGC GACGGTTTCC GCAAGGGGAT GGACCTGTAC
TTCCGGCGCC ACGACGGGCA GGCCGTCACG TGCGACGACT TCCGCCACGC GATGGCCGAC
GCGAACGGCC GCGACCTCGC GCTGTTCGAG CGCTGGTACA GCCAGGCGGG CACGCCGCGC
GTGACGGTTC GCACCGCTTA CGACGCCGCC GCGAAGCGCT ACGCGGTGAC GCTGCGGCAA
GGCTACGGCG ACGCCGCGCC CGCCGCGCGC GACACGCAGA AAGGGCCGCT CCTGATCCCG
TTCGCGATCG GCCTGATCGG CGCCGACGGC CGCGATCTGC CGCTGCGCCT CGAAGGCGAA
GCGGCCGCGT CGGGCACGAC GCGCGTGCTC GAGCTGACCG AGGCCGAGGC GACGTTCACG
TTCGTCGATA TCGACGCGGC GCCGCTGCCG TCGCTGCTGC GCAATTTCTC CGCGCCCGTG
ATCGTCGAAT ACGACTACCG CGACGACGAG CTCGCGTTCC TGCTCGCGCA CGACAGCGAT
CCGTTCAACC GCTGGGAGGC GGGCCAGCGC CTCGCGACGC GCGCGCTGCT CACGCTCGCG
TCGCGTGCGG CGGCGCAGCA GCCGCTCACG CTCGACGACG CGTTCGCCGC CGCGTTCAAG
CGCGTGCTGA CGGACGACAC GCTGTCGCCC GCGTTCCGCG AGCTCGCGCT CACGTTGCCG
TCTGAGGCCT ACCTCGCCGA CCAGATGACG CAGGCCGATC CGGCCGCCGT CCATCGCGCA
CGCCAGTTCG TGCGCCGCCA GCTCGCGACG GCGCTGCGCG CCGAGTGGCT CTCGGTCTAC
GAGCGCCACC AGACGCCGGG CGCGTATGCG CCGACGCCCG GCGACGCGGG CCGCCGCGCG
CTGAAGAACC TCGCGCTCGC CTACCTCGCC GAACTCGACG AGCCGGCCGA CGCGATCCGG
CTCGCCACCG CGCAATACGA CGCCGCGAAC AACATGACCG ACCGCGCGTG CGCGCTCGTC
GCGCTGCTGT CGGCCGCCGC CGCGTCGGCC GACGCGGCGC GCGCCGCCGA TCGCGCGCTC
GACGATTTCT ATCGCCGCTT CGAGAACGAA GCGCTCGTGA TCGACAAGTG GTTCTCGATG
CAGGCGACGC GGCGCGGCAC GCCCGAGCAT CCGACGCTCG ACATCGTGCG CAAGCTGCTC
GCGCATCCGG CGTTCAACCT GAAGAACCCG AACCGCGCAC GCTCGCTGAT CTTCGGCTTC
TGCTCGGCGA ATCCCGCGCA GTTCCATGCG GCCGACGGCT CGGGCTATGC GTTCTGGGCC
GATCAGGTGC TCGCGCTCGA CGCGCTCAAT CCGCAGATCG CCGCGCGGCT TGCGCGCGCG
CTCGAGCTGT GGCGCCGCTT CACGCCGTCG CTGCGCGAGA AGATGCGCGA CGCGCTCGAG
CGCGTCGCCG CGAACGCGCA GTCGCGCGAC GTGCGGGAGA TCGTCGAGAA GGCGCTCGCC
TGA
 
Protein sequence
MSDIAAPNAE IRRSDYTPPA FLIDTVSLEF DLEPARTIVT NTMRVRRNPD AAPAPHFELM 
GEALVLIGAR VDGKPHDAVR VHEHGLSVEN VPDAFELTIE NACAPESNTT LSGLYVSSGN
FFTQCEAEGF RRITYFVDRP DVMASYTVTL RADQAAYPVL LSNGNLVDAG DLPNGRHFAK
WEDPFKKPSY LFALVAGKLV KLEETIKSAS GKDKLLQVWV EPQDLGKTRH AMDSLIHSIR
WDERRFGLEL DLDRFMIVAV GDFNMGAMEN KGLNIFNTKY VLANPETATD VDFANVESVV
GHEYFHNWTG NRVTCRDWFQ LSLKEGLTVF RDQEFSADMS AGAEDDAAAR AVKRIEDVRV
LRQLQFAEDA GPMAHPVRPE RYVEINNFYT MTVYEKGAEV VRMYQTLFGR DGFRKGMDLY
FRRHDGQAVT CDDFRHAMAD ANGRDLALFE RWYSQAGTPR VTVRTAYDAA AKRYAVTLRQ
GYGDAAPAAR DTQKGPLLIP FAIGLIGADG RDLPLRLEGE AAASGTTRVL ELTEAEATFT
FVDIDAAPLP SLLRNFSAPV IVEYDYRDDE LAFLLAHDSD PFNRWEAGQR LATRALLTLA
SRAAAQQPLT LDDAFAAAFK RVLTDDTLSP AFRELALTLP SEAYLADQMT QADPAAVHRA
RQFVRRQLAT ALRAEWLSVY ERHQTPGAYA PTPGDAGRRA LKNLALAYLA ELDEPADAIR
LATAQYDAAN NMTDRACALV ALLSAAAASA DAARAADRAL DDFYRRFENE ALVIDKWFSM
QATRRGTPEH PTLDIVRKLL AHPAFNLKNP NRARSLIFGF CSANPAQFHA ADGSGYAFWA
DQVLALDALN PQIAARLARA LELWRRFTPS LREKMRDALE RVAANAQSRD VREIVEKALA