Gene BBta_4299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4299 
Symbol 
ID5149396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4503452 
End bp4504552 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content63% 
IMG OID640559118 
Productputative alkanesulfonate monooxygenase (FMNH2-dependent aliphatic sulfonate monooxygenase) 
Protein accessionYP_001240255 
Protein GI148255670 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0448772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00193589 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGCCA AGGATTCGTC CATCAAGTTC GCCTATTGGG TGCCCAACGT TTCGGGCGGA 
TTGGTGATCT CCAACATCGA GCAGCGGACG AGCTGGGATA TCGGCTACAA CAAGCGGCTG
GCACAGATCG CCGAGGAGAG CGGGTTCGAC TACGCGCTGA GCCAGATCCG TTTCACGGCT
GGCTATGGCG CCGAATATCA ACATGAATCG GTGTCGTTCA GCCATGCGCT GCTCGAAGCG
ACGACGAAGC TGAAGGTGAT CGCCGCGATC CTGCCCGGAC CGTGGCATCC GGCGGTCGTC
GCCAAGCAGA TCGCCACCAT CAATCATCTC ACCCAGGGGC GCGTTGCGGT GAACATCGTC
AGCGGCTGGT TCCGCGGCGA GTTCGCCGCG ATTGGCGAGC CCTGGCTGGA TCACGATGAG
CGCTATCGCC GCTCGGAGGA GTTCATCCGT GCGCTCCGGG GCATCTGGAC CGAGGACAAC
TTCAATCTCC GCGGCGACTT CTATCGTTTC ACGAACTACT CGCTGAAGCC GAAGCCGATC
GACCCGCAGC CGGAGATCTT CCAGGGCGGC AGCTCGCGTG CCGCCCGCGA CATGGCCTCG
CGCGTCTCTG ACTGGTACTT CACCAACGGC AACTCGGTGG AGGGCATCAA GGCCCAGGTC
GACGATATCA GGGCCAAGGC GAAGGCCAAC AATCACCAGG TCAAGATCGG CGTCAACGCG
TTCGCGATCG CGCGCGACAC CGAGGCCGAG GCGCAGGCCG TGCTCAAGGA AATCATCGAC
AACGCCAATC CCGAGGCCGT CCACGCCTTC GCGCACGAGG TGGCCAATGC CGGCAAGGCG
TCGCCGGAGC GCGAGGGCAA TTGGGCGAAG TCGACCTTCG AGGATCTCGT CCAATACAAT
GACGGCTTCA AGACCAACCT GATCGGCACG CCGCGGCAGA TTGCCGAGCG CATCGTCGCA
CTGAAGGAGG TCGGCGTTGA CCTCGTGCTG CTTGGCTTCC TGCACTTCCA GGAGGAGGTC
GCCTATTTCG GCAAGCACGT GATCCCGCTG GTGCGCGAGC TCGAGGCCAA GGCCGGCCTT
CGCGTGCAGG CTGCCGAGTA G
 
Protein sequence
MTAKDSSIKF AYWVPNVSGG LVISNIEQRT SWDIGYNKRL AQIAEESGFD YALSQIRFTA 
GYGAEYQHES VSFSHALLEA TTKLKVIAAI LPGPWHPAVV AKQIATINHL TQGRVAVNIV
SGWFRGEFAA IGEPWLDHDE RYRRSEEFIR ALRGIWTEDN FNLRGDFYRF TNYSLKPKPI
DPQPEIFQGG SSRAARDMAS RVSDWYFTNG NSVEGIKAQV DDIRAKAKAN NHQVKIGVNA
FAIARDTEAE AQAVLKEIID NANPEAVHAF AHEVANAGKA SPEREGNWAK STFEDLVQYN
DGFKTNLIGT PRQIAERIVA LKEVGVDLVL LGFLHFQEEV AYFGKHVIPL VRELEAKAGL
RVQAAE