Gene BURPS1106A_A0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0487 
SymbolphnS 
ID4904629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp477213 
End bp478295 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content66% 
IMG OID640143593 
Product2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate-binding protein 
Protein accessionYP_001074529 
Protein GI126456762 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR03227] 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTATA CGAATTTCCC GCGCGGCGGC GCCTGGCGCC GCTTCGCGCT CGCCGCCGGC 
GCCGCCGCGC TGTTGCAAGG CGCGGCAGCG CAGGCGCAGG CGGCAGCCGT CGTGCTGTAC
ACGGCGGACG GCCTCGAGAA CCTGTACCGC GACGTGCTGC CCGCGTTCGA GAAGCAGGAA
GGCGTGAAGG TGAACATCGT GACGGCGGGC AGCGGCGAAG TGGTGAACCG CGCGAACGTC
GAGAAGGGCT CGCCGAAGGC CGACGTGATC GTCACGCTGC CGCCGTTCAT TCAGCAGGCC
GGCCAGTTCG GCCTGCTGCA GCCGTACCGC AGCGTCAACT ACAAGAACGT GCCGGCGATC
GCGAAGGCGG AAGACGGCTC ATGGGCGACG TTCGTCAACA ACTACTTCTC GTTCGCGATC
AACCCGTCGG TCGTGAAGAG CCAGCCGAAG ACGTTCGCCG ATCTGCTGCA TCCCGATTAC
AGCGGCAAGC TCGCGTATTC GAACCCGGCG ACGGCGGGCG ACGGGATGGC CGTCATCATC
CTGACGAGCG CGCTGATGGG CGAGGACAAG GCGTTCGACT ATCTCGCGAA GCTCGAGCGC
AGCGTGAAGT TCCACACGAA GGGCACGGGC TACCTGAACG TGCTGCTGTC GCGCAACGAG
ATCGCGGTCG CGAACGGCGA TCTGCAGATG GATCTGGACG ACGCCGAGCA CGGCGGCCTG
TCGATCAAGC CGATCTTCGT CGCCGCGAAG GCGGGCGAGC CGCCGACGAC GTTCCAGTTG
CCGTACGCGA TCGGCCTCGT CAAGGGCGGC CCGAACCAGG ACGCGGGCAG GAAGCTGATC
GACTACCTGA TGTCGGCCGA CGTGCAGGCG AAGGTGCCCG ACATGTTCGG CATTCCGGGC
CGCACCGACG TGCCGCTTTC GGGCAAGAAC GGCGAGGCGG TGAAGCGCGC GATCGCCGGC
GTGAAGCTGA TTCCGGTCGA CTGGGACGCG GTGATGGCGA AGAAGCCCGT GTGGACCGAG
CGCTGGAAGA AGGAAGTGAT CGGCGATTCG GGCAAGCAGA CCGAAGTCGT CAAGCCGAAA
TGA
 
Protein sequence
MTYTNFPRGG AWRRFALAAG AAALLQGAAA QAQAAAVVLY TADGLENLYR DVLPAFEKQE 
GVKVNIVTAG SGEVVNRANV EKGSPKADVI VTLPPFIQQA GQFGLLQPYR SVNYKNVPAI
AKAEDGSWAT FVNNYFSFAI NPSVVKSQPK TFADLLHPDY SGKLAYSNPA TAGDGMAVII
LTSALMGEDK AFDYLAKLER SVKFHTKGTG YLNVLLSRNE IAVANGDLQM DLDDAEHGGL
SIKPIFVAAK AGEPPTTFQL PYAIGLVKGG PNQDAGRKLI DYLMSADVQA KVPDMFGIPG
RTDVPLSGKN GEAVKRAIAG VKLIPVDWDA VMAKKPVWTE RWKKEVIGDS GKQTEVVKPK