Gene BBta_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3303 
Symbol 
ID5154022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3458209 
End bp3459276 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content65% 
IMG OID640558166 
Productputative alkanesulfonate ABC transporter, substrate binding protein 
Protein accessionYP_001239313 
Protein GI148254728 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACC GGCGGAGGCT GTTGCAGGCC GGCGCTGCGG CAGCGATTGC GGCGCCGCTT 
GGCGGGTATG GGGCGCAGGC ATTGTCGCTG CGCGCGACAC CGCAGATCGA TCTCTCGCAG
TTTCCGATCT GCAGGACCGC GACGGATGGA CCAGCCCTCA CGGGGGCGCC GCGCAAGCTG
AAACTGTCCT GGAATGCCGG TGCGGTCTGC CTGACGCCGG TCCCGGTGGC GATCGATCAG
GGCTTCTTCA AAAAGCACAA TCTCGACGTC GAGCTGATCA ACTACTCGGG CTCGACCGAC
CAGTTGCTGG AGGCCATCGC GACCGGCAAG AGCGACGCCG GCCTCGGCAT GGCGCTGCGC
TGGCTGAAGC CGCTGGAGCA GGGTTTTGAT GTCAAGATCG CCGCCGGCAC GCATGGTGGC
TGCATGCGCG TGCTGGTGCG CGCCGACTCC GGGATCACCA AGCTCGCAGA CCTTAAGGGC
AAGGCGGTCG CGGTCGGCGA TCTCGGTGGC CCCGACAAGA ACTTCTTCTC GATCCAGCTG
GCGCGGCTTG GCATCGATCC GGTGAAGGAC GTCGACTGGC GGGTCTATCC GGGGGCCGTC
GTCAACGTCG CGGCCGACAA GGGCGAGACG CAGGCGTTCC TGGCCTCCGA TCCGCTCGCT
TATCTCTGGC TCAAGGACCC CGCCTACAAG GAAGTGGCCT CCAATCTCGA CGGCGAGTAT
CAGAACCGGG TCTGCTGCAT CCTCGGGCTG CGCGGCAGCC TGGTGCGCGA CGAGCCGGCC
GTGGGGCGCG CCATCACCCA GGCGCTGCTC GATGCCGCGA TGTTCACGGC ACAGAATCCG
AGCGAGGCGG CGAAGTCGTT CCAGCCCTAC GCGCCGAAGC AGGCGACGCT GGCCGATCTC
GAAGGCATGG CGCGCTACCA CACCCATCAT CACCATCCGC ATGGTCCGGC GCTGAAGCAG
GAGCTGAAGG CCTATGCCGA CGACCTCAAG GTCGTCTCCG TGTTCAAGCC GAGCACCGAC
ACCAACAAAT TTGCCGAGCG AATCTATGTC GACATTTTCT CTGTCTGA
 
Protein sequence
MLDRRRLLQA GAAAAIAAPL GGYGAQALSL RATPQIDLSQ FPICRTATDG PALTGAPRKL 
KLSWNAGAVC LTPVPVAIDQ GFFKKHNLDV ELINYSGSTD QLLEAIATGK SDAGLGMALR
WLKPLEQGFD VKIAAGTHGG CMRVLVRADS GITKLADLKG KAVAVGDLGG PDKNFFSIQL
ARLGIDPVKD VDWRVYPGAV VNVAADKGET QAFLASDPLA YLWLKDPAYK EVASNLDGEY
QNRVCCILGL RGSLVRDEPA VGRAITQALL DAAMFTAQNP SEAAKSFQPY APKQATLADL
EGMARYHTHH HHPHGPALKQ ELKAYADDLK VVSVFKPSTD TNKFAERIYV DIFSV