Gene BURPS668_A0448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0448 
Symbol 
ID4886337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp409836 
End bp410924 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content68% 
IMG OID640130389 
Productputative ABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_001061454 
Protein GI126443162 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA ACCGGCCATT GCTGACGGCG CTGCGCCGCG CCGCGCTCGC CTTCGGCATG 
TGCGCGACGC TCGTCGCGAA CGGCGCATCC GCCGAGCCGC TTTACGCGGG CGAAGACGCG
CTCTATGCGA AGGCCGCCGA CGAAGGGCTC GTCGTGTCGT TCGACACGGG CCCCGAATGG
GCGAACTGGA AGGCGCTGTT CGCGGCGTTC CGCAAGCGCT ATCCGAAGGT GGAGCTCACG
TACAACGACA TCGGCTCGGC CGCGACGGTC GTCGCGCTCG ACAAGTCACG CCGCCGTCCG
CAGGCGGACA CCGCGTACTA CTTCGCGGCA TCGGCGCTCG ACGCGGCTGG CAAGGACGTC
GTCGCGCCGT TCAAGCCGGT CAACTTCGAC AAGCTCCCGC CCGTGTTTCG CGAAGCCGAC
GGCCGCTGGT TCACCGTGCA TTCGCTGAAT GTCGCGTTCC TCGTCAACCG CAAGCTCGTG
AAGAACGTGC CGCGCCGCTG GTCCGATCTG TTGAAGCCCG AGTACAGGAA CGCGGTCGTC
TATCTCGACC CGCGTTCGAC CGGCCAGGGG CAGGTGGTCG TGTTCGCGGC GGCGTCTGCG
CTCGGCGGCG GCGTCGACGA TCCGAAGCCC GGCGCGGAAT TCTTCGGAAA GCTAAAGCAT
GCGGGCAACG TGCTGCGCAT CGAGGGCACG ACGCCGTATG CGAAGTTCGT CAAGGGTGAG
ATCCCGATCC TGATCGGCTA CGAGAACGAC GGCCTGAAGG CGAAGTACGC GGACGGCCTG
GGCGATGCGG TCGACGTCGT GATTCCGCAG GACGGCAGCG TGTGCGCGCC GTATGCGATG
AGCCTCGTGA AGAACGGGCC GAATCCTGCT GCCGCGCAGC TATGGTTGAA CTTCGTGATG
AGCGATGCCG GCCAGGCGCT GTTCGCGCAC GGCTACGTGC GGCCCGCAGT GCCGGGCGTC
GCGCTCGCGC CCGACGTCGC GGCGAAGATG CCGAACGCGC CACAGGTGCG TGCGCTCGAC
GTCGCGAAGG CCGCCGCGCG CAAGGCCGAA GTCGACCGGC TATGGTCGCA GGCGGCGCTT
GGCCAGTAA
 
Protein sequence
MSENRPLLTA LRRAALAFGM CATLVANGAS AEPLYAGEDA LYAKAADEGL VVSFDTGPEW 
ANWKALFAAF RKRYPKVELT YNDIGSAATV VALDKSRRRP QADTAYYFAA SALDAAGKDV
VAPFKPVNFD KLPPVFREAD GRWFTVHSLN VAFLVNRKLV KNVPRRWSDL LKPEYRNAVV
YLDPRSTGQG QVVVFAAASA LGGGVDDPKP GAEFFGKLKH AGNVLRIEGT TPYAKFVKGE
IPILIGYEND GLKAKYADGL GDAVDVVIPQ DGSVCAPYAM SLVKNGPNPA AAQLWLNFVM
SDAGQALFAH GYVRPAVPGV ALAPDVAAKM PNAPQVRALD VAKAAARKAE VDRLWSQAAL
GQ