Gene BURPS668_3067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3067 
Symbol 
ID4883823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3008433 
End bp3009740 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content64% 
IMG OID640128995 
Productputative amino acid ABC transporter, periplasmic amino acid-binding protein 
Protein accessionYP_001060079 
Protein GI126438928 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.656729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTC GCAGTCTTTT GAAGTTCGGA TCGATGGCAG GCGTCATGGC GCTCGCGGGG 
CAAAGCCCCG TCGCGCGCGC GGCGGATTCG GGCAAAGGCC CGATCAAGGT CGGCATCCTG
CATTCGCTGT CGGGCACGAT GGCGATCTCC GAGACTTCGC TCAAGGACAC CGCGCTGATG
ACGATCGCCG ACATCAACAA GAACGGCGGC GTGCTCGGGC GGCCGCTGCA GCCCGTCGTC
GTCGATCCCG CGTCGAACTG GCCGCTGTTC GCCGAGAAGG CGCGCCAGTT GCTCACGCAG
GAGAAGGTCG CATGCGTGTT CGGCTGCTGG ACGTCGGTGT CGCGCAAGTC GGTGCTGCCC
GTGTTCGAGG AGCTGAACGG CCTGCTCTAC TACCCGGTGC AGTACGAGGG CGAGGAGATG
TCGCGCAACG TGTTCTACAC GGGCGCCGCG CCGAACCAGC AGGCGATTCC GGCCGTCGAG
TACATGATGA GCGCCGAAGG CGGCGGCGCG AAGCGCTTCT TCCTGCTCGG CACCGATTAC
GTCTACCCGC GCACGACCAA CAAGATCCTG CGCGCGTTCC TGAAATCGAA GGGCGTGAAA
GATTCCGATA TTCAGGAAGT CTACACACCG TTCGGGCACA GCGATTACCA GACGATCGTC
GCGAACATCA AGACCTTCGC GCAAGGCGGC AAGACCACCG TGATCTCGAC GATCAACGGC
GATTCGAACG TGCCGTTCTA CAAGGAGCTC GGCAATCAGG GGCTCAAGGC GACCGACGTG
CCCGTCGTCG CGTTCTCGGT CGGCGAGGAG GAACTGCGCG GCATCGACAC GAAGCCGCTC
GTCGGGCATC TGGCCGCGTG GAATTACTTC ATGTCGGTGA AGGGGCCGGC GAACGCGAAG
TTCAAGGAGC AGTTCGCCGC GTGGGTGAAG TCGCAGAATC TGCCGGGCGG CGCGAAGCGC
GTGACCAACG ATCCGATGGA GGCGACGTTC GTCGGCATCC ACATGTGGAA GCAGGCGGTC
GAGAAGGCGA AGAGCACCGA TGTCGACCGC GTGCGCACGG CGATGATCGG CCAGAGCGTC
GCCGCGCCGT CGGGCTTCAC ACTGACGATG GACGGCAACC ATCATCTGCA CAAGCCGGTG
ATGATCGGCG AGATTCGCGG CGACGGCCAG TTCAACGTCG TCTGGAAAAC GAAGACGGCG
ATTCGCGCGC AGCCGTGGAG CCCGTTCATC GCGGGCAACC AGGGCAAGCC GGACGTGGTC
GGCTCGATTC CGGAGTTCCT GCGCCGCCGT CGCGCCGCGC TCGCCTGA
 
Protein sequence
MKRRSLLKFG SMAGVMALAG QSPVARAADS GKGPIKVGIL HSLSGTMAIS ETSLKDTALM 
TIADINKNGG VLGRPLQPVV VDPASNWPLF AEKARQLLTQ EKVACVFGCW TSVSRKSVLP
VFEELNGLLY YPVQYEGEEM SRNVFYTGAA PNQQAIPAVE YMMSAEGGGA KRFFLLGTDY
VYPRTTNKIL RAFLKSKGVK DSDIQEVYTP FGHSDYQTIV ANIKTFAQGG KTTVISTING
DSNVPFYKEL GNQGLKATDV PVVAFSVGEE ELRGIDTKPL VGHLAAWNYF MSVKGPANAK
FKEQFAAWVK SQNLPGGAKR VTNDPMEATF VGIHMWKQAV EKAKSTDVDR VRTAMIGQSV
AAPSGFTLTM DGNHHLHKPV MIGEIRGDGQ FNVVWKTKTA IRAQPWSPFI AGNQGKPDVV
GSIPEFLRRR RAALA