Gene BURPS668_3986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3986 
Symbol 
ID4885492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3889930 
End bp3891351 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID640129914 
Productputative amino acid uptake ABC transporter, periplasmic amino acid-binding protein 
Protein accessionYP_001060979 
Protein GI126440373 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGG CGGCGGCGAT ACGGGCGGCC GGCCGCGTTT TTTGTAACCG CGGCGCGGGC 
GGCATCGGCC GGAAGATCGC GGTAGAGGCA GCAGAAGGGC GCGTGCGATG CGCACGCGCG
AGGGGAGCGC AACAGCAAGA AGAACGCGCC CCGCGCCGGT TTCACGTCGA ACGTGAAACC
GGCGCGGCGG CGTGCAGGTT CGATCGGTGC GCCATGTCGC CATGTCCGCT CGATCGCGTT
TCGCGCAGCG TTTGGAGACA CACAATGAAA ATGAATCGAT GGATCGAGGT TGCGTTCGCG
GCCGGCTGCC TATGTACTGC CGGGCTTGCG TCGGCGCAGG TGAAGATCGG CGTCACGGTT
TCGGCAACGG GGGCCGCCGC GTCGCTCGGC ATTCCCGAGA AGAATACGGT TGCGTTGCTG
CCGAAGGAGA TCGGCGGCAA GCGCGTCGAG TACATCGTGC TCGACGATGC GTCGGACACG
AGCCGCGCAG TGCAGAACAC GCGCAAGCTG ATCGACGAGG ATCATGTCGA CGCGATCGTC
GGCTCGTCGA TCACGCCGAA TGCGCTCGCG ATGATCGACG TCGTCGCGCA GGGCAAGACG
CCGGCGATCT CGCTTGCCGC AAGCGCGCAC ATCATTGCGC CGATGGATGC GAAGCGCGCA
TGGGTGTTCA AGACGCCGCA AAACGATCGC CTGATGGCTG ACGCGCTCGC CGGCTACATG
GCGAAGCACG GCGTGAAGAC GGTCGGCTTC ATCGGCTTCG CGGATGCGTA TGGCGACGGC
TGGTATGGCG TGTTCAGCGC GGCCGCCGCT GCGAACGGGC TCAGGATTGT CGCGAACGAG
CGCTACAACC GCACCGATGC ATCGGTGACG GGGCAAGTGC TGAAGACGTT GGGCGCGCGC
CCCGACGCGG TGCTGATCGC CGGCGCGGGC ACGCCTGCGG CGCTGCCCGC GAAGACGCTG
AAGGCGCGCG GTTACACGGG CAAGGTCTAT CAGACGCACG GCGTCGCGAA CAACGATTTC
CTGCGCGTCT GCGGCAAGGA TTGCGACGGC GAATTGCTGC CGGCGGGGCC GGTGCTCGTT
GCCGATCAAC TGCCCGATTC GAATCCGGCC AAGCAGCCGG CGCTCGCGTA CAAGGCCGCG
TATGAGAAGG CGTACGGCGC GGCAGCGGTG TCGACGTTCG GCGGCCATGT GTGGGACGCG
GGGCTGATGC TCCAGCGCGC GATTCCGGAC GCGCTGAAGA AGGCGCAGCC CGGCACGCCG
GCGTTCCGCG AGGCGCTGCG CGGTGCGCTC GAGAACGTGA AGGACTTGCC GGTGTCGCAC
GGCGTGATCA ACACGACGCC CGCCGACCAC AACGGCTTCG ATACGCGCGC GCGCGTGATC
GTGCAGATCG TCGGCGACAA GTGGAAACTG CAGGCTGATT GA
 
Protein sequence
MATAAAIRAA GRVFCNRGAG GIGRKIAVEA AEGRVRCARA RGAQQQEERA PRRFHVERET 
GAAACRFDRC AMSPCPLDRV SRSVWRHTMK MNRWIEVAFA AGCLCTAGLA SAQVKIGVTV
SATGAAASLG IPEKNTVALL PKEIGGKRVE YIVLDDASDT SRAVQNTRKL IDEDHVDAIV
GSSITPNALA MIDVVAQGKT PAISLAASAH IIAPMDAKRA WVFKTPQNDR LMADALAGYM
AKHGVKTVGF IGFADAYGDG WYGVFSAAAA ANGLRIVANE RYNRTDASVT GQVLKTLGAR
PDAVLIAGAG TPAALPAKTL KARGYTGKVY QTHGVANNDF LRVCGKDCDG ELLPAGPVLV
ADQLPDSNPA KQPALAYKAA YEKAYGAAAV STFGGHVWDA GLMLQRAIPD ALKKAQPGTP
AFREALRGAL ENVKDLPVSH GVINTTPADH NGFDTRARVI VQIVGDKWKL QAD