Gene BURPS1106A_A0757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0757 
Symbol 
ID4905496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp749460 
End bp750686 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID640143863 
Productpeptidase family protein 
Protein accessionYP_001074793 
Protein GI126456186 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.31218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAGC TCCAACACCT CACCGCCGCG GTCTGCAGCG CCCTCTGCGT ATCGGCGGCC 
CATGCCGCGC CGGTGTGGAT CACGCTCTCC GAGCCCGCCC TGCGCGAGCT GCGCGCGCTC
GATCCCGCCG TGACGAGCCG TTACAGCGCG GCGCTCGCCA CCGGCGACGC GAAGCGCACC
GAAACGATCC ACGTCGCGCA GGTCGACGAT TCGCTGCTCG AATCGCTGTC GCAGGCGATC
CGCCGCGCGC GCGGCCACGG CCCGGGCTTT TTCGTGCACG CGACGTTCGA CGAAGCGCGC
GCGTCGCTGC AGCCGAGCGC GGCGAAGCAG GCGGCCGCGA TCGATTACCC GATCACCTAC
TCGCAACAGG TCCGCAACTG GATCTCGCAA CTGCAGGCGA GCAACATCGT CAGCACCATC
GTCTCGCTGT CCGGCTTCAC GAACCGCTAC TACACGACGA CGCACGGCGT GGCCGCGTCC
GACTGGATCG CGCAGCAATG GAAGCAGTTG GCCGGCTCGC GCACCGACGT GACGGTCGAG
CAGTTCACGC ATGCCGGCTG GCCGCAGAAA TCGGTGGTCC TGACGATCAA GGGCAGCGAT
CCGGCCGCGG GCGTCGTCGT GATCGGCGGC CATCTCGATT CGACCGTCGG CCGCATGAGC
GAGAACACGC GCGCGCCCGG CGCGGACGAC GACGCATCCG GCATCGCAAG CCTCACCGAG
GCGCTGCGCG TGCTGCTCGC GAACCGCTAC CAGCCGAAGC GCACGCTCAA GTTCATCGGC
TACGCGGCGG AAGAGGCGGG CCTTCTCGGC TCGCAGGCGA TCGCGAAGCA GTTCAGGGCG
CAGAACGTGA ACGTCGTCGG CGCGTTCCAG CTCGACATGA CGAACTACAA GGGAGATCCG
AAGGATATCT ATCTGATCAG CGACTACACG AACGCGACAC AGAACACGTA CCTCGCGAAC
CTCGCGAAAG CGTATCTGCC CGAGCTCGCG GTCGGCACGT CGCAATGCGG CTATGCGTGC
TCCGATCACG CGTCGTGGAA CGCGCAGGGC TATCCGGCGT CGTTCCCGTT CGAAGCGGAT
CAGAACGACA ATCCGTACAT CCATTCCGCG TATGACACGC TCGAGCGGTC GGACTCGCAA
GGCAACCACG CGCTGAAGTT CAGCAAGCTC GCGCTCGCGT ACGCGGCGGA GCTGGGCGGC
GGGCTGAGCG CGTCCGCGAA GCGGTAA
 
Protein sequence
MSKLQHLTAA VCSALCVSAA HAAPVWITLS EPALRELRAL DPAVTSRYSA ALATGDAKRT 
ETIHVAQVDD SLLESLSQAI RRARGHGPGF FVHATFDEAR ASLQPSAAKQ AAAIDYPITY
SQQVRNWISQ LQASNIVSTI VSLSGFTNRY YTTTHGVAAS DWIAQQWKQL AGSRTDVTVE
QFTHAGWPQK SVVLTIKGSD PAAGVVVIGG HLDSTVGRMS ENTRAPGADD DASGIASLTE
ALRVLLANRY QPKRTLKFIG YAAEEAGLLG SQAIAKQFRA QNVNVVGAFQ LDMTNYKGDP
KDIYLISDYT NATQNTYLAN LAKAYLPELA VGTSQCGYAC SDHASWNAQG YPASFPFEAD
QNDNPYIHSA YDTLERSDSQ GNHALKFSKL ALAYAAELGG GLSASAKR