Gene BURPS1106A_4065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_4065 
Symbol 
ID4899966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3970562 
End bp3971758 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content64% 
IMG OID640137291 
Productputative amino acid uptake ABC transporter, periplasmic amino acid-binding protein 
Protein accessionYP_001068284 
Protein GI126454254 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGA AGACCCTTGC GCACGCCTGT CTTGCCGTCG CCGCCGCGTG GTCGGTGGGC 
GCCGCGCAGG CCGCGGATTC CGTGAAGATC GGCTTCATCA CCGACATGTC CGGCCTCTAT
GCCGACATCG ACGGGCAGGG CGGCCTCGAG GCGATCAAGA TGGCGGTGGC CGACTTCGGC
GGCAAGGTCA ACGGCAAGCC GATCGAGGTC GTGTATGCGG ACCACCAGAA CAAGGCGGAC
ATCGCCGCAT CGAAGGCGCG CGAATGGATG GACCGCGGCG GGCTCGACCT GCTCGTCGGC
GGCACGAACT CGGCGACCGC GCTGTCGATG AACCAGGTCG CGGCCGAGAA GAAGAAGGTC
TACATCAACA TCGGCGCGGG CGCGGACACG CTGACGAACG AGCAGTGCAC GCCGTACACG
GTCCACTACG CGTACGACAC GATGGCGCTC GCGAAGGGCA CGGGCTCGGC GGTGGTGAAG
CAGGGCGGCA AGACGTGGTT CTTCCTGACC GCCGATTACG CGTTCGGCAA GGCGCTCGAG
AAGAACACCG CGGACGTCGT CAAGGCCAAC GGCGGCAAGG TGCTCGGCGA AGTGCGCCAT
CCGCTGTCGG CGTCGGATTT CTCGTCGTTC CTGTTGCAGG CGCAGTCGTC GAAGGCGCAG
ATCCTCGGCC TCGCGAACGC GGGGGGCGAC ACGGTGAACG CGATCAAGGC GGCGAAGGAA
TTCGGCATCA CGAAGACGAT GAAGCTCGCC GCGCTGCTGA TGTTCATCAA CGATGTCCAC
GCGCTCGGCC TCGAGACGAC GCAAGGTCTC GTGCTGACGG ACAGCTGGTA CTGGAACCGC
GATCAGGCGT CGCGGCAGTG GGCGCAGCGC TATTTCGCGA AGATGAAGAA GATGCCGTCG
AGCCTGCAGG CGGCGGACTA TTCGTCGGTG ACGACTTACC TGAAGGCGGT GCAGGCGGCG
GGCTCGACCG ATTCCGACAA GGTGATGGCG CAGCTCAAGA AGATGAAGAT CGACGACTTC
TATGCGAAGG GCTCCATCCG CACGGACGGC AGCATGATTC ACGACATGTA TCTGATGGAA
GTGAAGAAGC CGTCCGAATC GAAGGAGCCG TGGGACTACT ACAAGGTCGT CGCGACGATT
CCGGGCGAGC AGGCATTCAC GACGAAGCAG GAGACGCGCT GCGCGCTCTG GAAGTGA
 
Protein sequence
MKLKTLAHAC LAVAAAWSVG AAQAADSVKI GFITDMSGLY ADIDGQGGLE AIKMAVADFG 
GKVNGKPIEV VYADHQNKAD IAASKAREWM DRGGLDLLVG GTNSATALSM NQVAAEKKKV
YINIGAGADT LTNEQCTPYT VHYAYDTMAL AKGTGSAVVK QGGKTWFFLT ADYAFGKALE
KNTADVVKAN GGKVLGEVRH PLSASDFSSF LLQAQSSKAQ ILGLANAGGD TVNAIKAAKE
FGITKTMKLA ALLMFINDVH ALGLETTQGL VLTDSWYWNR DQASRQWAQR YFAKMKKMPS
SLQAADYSSV TTYLKAVQAA GSTDSDKVMA QLKKMKIDDF YAKGSIRTDG SMIHDMYLME
VKKPSESKEP WDYYKVVATI PGEQAFTTKQ ETRCALWK