Gene BBta_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_0204 
Symbol 
ID5150413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp209016 
End bp210635 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content67% 
IMG OID640555229 
Productputative multidrug resistance protein 
Protein accessionYP_001236407 
Protein GI148251822 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAC CGGCGACGCC GCAGGAGACG CCAGGCCTCG CCACTTGGCT CGGCTTCATC 
CTGATGTGCG TCGGCATGTT CATGGCGATC CTGGACATCC AGGTGGTGGC GACGTCACTG
CCGACGATCC AGGAAGCGCT CGGCATCACG CCGGACGCCA TGAGCTGGGT CCAAACCGCG
TATCTGATCG CCGAGATCAT CGCCATTCCG CTGACCGGCC TGCTCACCCG CGTCTTCAGC
CTGCGTTGGC TATTCGTCGG CGCCGTCAGC ATCTTCACGG CGGCCTCGCT CGGCTGCGCG
ATGAGCGGCA GCTTCCACAT GCTGCTGGCG TTCCGCGTGC TGCAGGGCTT CTTCGGCGGC
CTGTTGATCC CCGTGGTGTT CTCGGCGGTG TTCCTGCTGT TTCCGGCGCG GCTGCACGCG
GTCGCAACCA CGATCGGCGG CGTGGTTGCC GTGCTCGCGC CGACCATCGG CCCCGTCGTC
GGCGGCTTCA TCACCAACAC CTGGTCGTGG CCCTGGCTGT TCCTGATCAA CATCGTGCCG
GGCATCATCG CTGCGCTGAT GACGCCGAGC CTGCTGCCGA AGCAGCGGAT GAATCTCATC
GAGCTCGACA AGCTCGACCT GCTGGCGCTG ATGCTGCTCG CGGTGTCGCT GGCGAGCCTC
GAACTCGGCC TGAAGGAGGC CCCCAAGGGC GGCTGGCTCT CGTCGAACTG CATCGCGCTG
CTGGTGCTGA GCGGATCCTG CCTGACTCTG CTGATCCAGC GGCTGCTGGC TTCGCCGCAT
CCGATCCTGC GGCTCGGCAG CTTTCAGCGC CGCTCGTTCA CGCTCGGCTG CGTGTCGAGC
TTCTGTCTCG GCATCGGCCT GTTCGGCTCG GTCTATCTGA TGCCGGTGTT CCTCGCCTTC
GTGCGCCAGC ACGATGCGTT CGAGATCGGC AAGATCATGC TGGTGACTGG GGTGGCGCAA
CTGATCGCAG CGCCGCTGGT GACAGCGCTG GACGGCAAGG TCGATGCCCG GCTGCTGACC
TCGTTCGGCT TCGCGCTGTT CAGCGCCGGC CTTGCCGCTA GCGCGTTCCA GCCGGCCAGT
GCGGACTATC AGGAAATGTT CTGGCCGCAG GTGGTGCGCG GTGTCGGCAT CATGTTCTGT
CTGTTGCCGC CGACGCGGAT CGCGCTCGGC GATCTGCCGC AGGCCGAGGT CGCCGACGCC
AGCGGCCTGT TCAACCTGAT GCGCAATCTC GGCGGCGCGA TCGGCATCGC TCTGATCGAC
ACGATCATCT ACGGCCGCGT CGGGCTGCAT GCCCAGGCAT TCCGCGACCG GCTGATGGCT
GGCGACACCG CGGCCGCCAA GGCGATCGGA TTGGCCCCGG AGCTGTTGCG CAACCGGCCG
CGCGGCGTGT CCGAGGAAGC CGCGATCGCC TATGTCCGGC CGCTGGTGGA GAAGGCGTCG
CTGGCGCTAT GCGTCAACGA GGCCTGGGCG CTGCTTGCCG GCGTTGCGCT GCTCGGCTTC
ATTCTCATTC CTTTTGCCCG CAACAGGACC GAGAGCCCAT CGCCGCGGCG GCTGGCCCTA
GAGGCTGCGC AGCCGGCGCT CCGACGACGG CTCGATCAGC TCGGGCCGCG GTCCGGATAG
 
Protein sequence
MSAPATPQET PGLATWLGFI LMCVGMFMAI LDIQVVATSL PTIQEALGIT PDAMSWVQTA 
YLIAEIIAIP LTGLLTRVFS LRWLFVGAVS IFTAASLGCA MSGSFHMLLA FRVLQGFFGG
LLIPVVFSAV FLLFPARLHA VATTIGGVVA VLAPTIGPVV GGFITNTWSW PWLFLINIVP
GIIAALMTPS LLPKQRMNLI ELDKLDLLAL MLLAVSLASL ELGLKEAPKG GWLSSNCIAL
LVLSGSCLTL LIQRLLASPH PILRLGSFQR RSFTLGCVSS FCLGIGLFGS VYLMPVFLAF
VRQHDAFEIG KIMLVTGVAQ LIAAPLVTAL DGKVDARLLT SFGFALFSAG LAASAFQPAS
ADYQEMFWPQ VVRGVGIMFC LLPPTRIALG DLPQAEVADA SGLFNLMRNL GGAIGIALID
TIIYGRVGLH AQAFRDRLMA GDTAAAKAIG LAPELLRNRP RGVSEEAAIA YVRPLVEKAS
LALCVNEAWA LLAGVALLGF ILIPFARNRT ESPSPRRLAL EAAQPALRRR LDQLGPRSG