Gene BBta_3300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3300 
SymbolssuD 
ID5154019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3454774 
End bp3455874 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content65% 
IMG OID640558163 
Productputative alkanesulfonate monooxygenase 
Protein accessionYP_001239310 
Protein GI148254725 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.501296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.346185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCATC CGCTTCGTTT CGGCATCTGG GCGTCGGTGC ATGGCCCGCG CGCGGCCCAT 
CAGGATCCGG AGGAGCCCTA TGATGCGTCG TGGGAGCGCA ATCGCGCTCT CGTGTTGAAG
GCGGAGGAGC TCGGCTACGA TGCCACGCTG GTGGCCCAGC ACACGATCAA TCCGCATCAG
GAGGATCTCG ATCAGCTGGA GGCCTGGAGC GCGGCCGCGG CGCTCGCCGC GCTCACCAGC
CGCATCGAAA TCATCACTGC CATCAAGCCA TACCTCTTCC ATCCCGTGGT GCTCGCCAAG
ATGGCGCTCG GCATCGAAAA CATCAGCAGA GGTCGCTTCG CGATCAACCT CGTCAATGCC
TGGAACAGGC CGGAGCTGGC GCGCGCCGGG ATCGGCTTTG CCGAGCACGA TGAGCGCTAT
GCCTATGGCC GGGAATGGAT CACCGTGGTG TCGCGGCTGC TGCAGGGCGA GCGCCTGACC
TACAAAGGCA AATATTTCGA TGTGCAGGAC TACGCGCTGC GGCCGAAGGA TCTCTATCGT
GCGCGGCCGC GCATCTATGT CGGCGGCGAA TCGGAGCCGG CGCGGGCGCT GGTGGCCGAT
CATGGCGATG TCTGGTTCAT CAACGGCCAG CCGCTCGCCG ATGTCGCCGC GCTGATCAAC
AATGTCGCCT CCCGGCCGAG AGGCGCGGCC GTGCCGCTTC GCTTCGGCCT GTCGGCGTTC
GTGATCGCGC GGTCGACCGC GGCGGAGGCT GATGCGGCGC ATGAACGACT GCTGCGGCTT
GCGGAGAAGG ACGCGCCGAT GAAGGCGATC CAGAAGCAGA ACACCGACCC CAAGGTCGTC
ATGATGCAGA CCATGCAGAA GAGCGCGCGG GTGGGCAGCA ATGGCGGTAC GGCCGCCGGG
CTTGTCGGCA GCTATGACGA GGTGGCCGAT CGCATCATCG ACTTTCATGC CGCTGGCATC
GAACTGTTCA TGCTGCAATT TCAGCCCTTC GAGGCCGAGA TGACGCGGTT CGCCGAAGAG
ATCATCCCGC GGATACGGCA GCGCCAGGCC GAGCGCGGCG TTGAGAGGTC CCACCGTCAG
GCGGCCGGCC GCGTCGGCTG A
 
Protein sequence
MVHPLRFGIW ASVHGPRAAH QDPEEPYDAS WERNRALVLK AEELGYDATL VAQHTINPHQ 
EDLDQLEAWS AAAALAALTS RIEIITAIKP YLFHPVVLAK MALGIENISR GRFAINLVNA
WNRPELARAG IGFAEHDERY AYGREWITVV SRLLQGERLT YKGKYFDVQD YALRPKDLYR
ARPRIYVGGE SEPARALVAD HGDVWFINGQ PLADVAALIN NVASRPRGAA VPLRFGLSAF
VIARSTAAEA DAAHERLLRL AEKDAPMKAI QKQNTDPKVV MMQTMQKSAR VGSNGGTAAG
LVGSYDEVAD RIIDFHAAGI ELFMLQFQPF EAEMTRFAEE IIPRIRQRQA ERGVERSHRQ
AAGRVG