Gene BBta_3533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3533 
Symbol 
ID5151480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3685784 
End bp3688012 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content65% 
IMG OID640558385 
ProductSPINDLY family O-linked N-acetylglucosamine transferase 
Protein accessionYP_001239532 
Protein GI148254947 
COG category[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAAGCA GCGTCGGGGC GCGGGCCTTT CAGAACGCAC GGCTCCAAAA ACGGGCCAAG 
AAGCAGGCGG ACGCGCTGTT GCCGGCCGCC ATCAAGGCCT ATCGGGAAGG CCGACAGGCG
GAAGCTCAGG CGCTGTGCCA GCAGATCCTG CAGGACCTGC CCTCTCATTT CGGCGCCCTG
CATCTGCTCG GCGTGTCCGA ACGCGACAGC GGCCGTTTCG ACGAGGCGGT GCTCGTCCTG
ACGCGCGCAA TCGAGAGCGA TCCGCGGTCG GCCGAAGCCC AGTCGGATCT CGGCCTGGCG
CTGTTCCGGC TCGGCCGCTA CGAGGAGGCC CGCGCCCGAT ACGAGCGCGC CATCGCCCTC
AGGCCGAACT TCCCGGCAGC GCTCACCCAT CTCGGCAATA CGCTGATGAA CCTGTTCCGT
TTCGAGGAGG CGATATCGGC GCATGACCGC GCCATTGCGC TAAAGCCCGA TTATGGCGAG
GCCCATGCCA ATCGCGGCAT GGCGCTGATG TTCACGAGCC GCAACGGTGA GGCCGCCGAG
AGCTTCGATC GCGCGCTCTC GCTGCAGCCG CGGTTGCTGA CGGCCCTGTT CGGCAAGGGC
GTGGCCAGCA TGTACCTGCG CGATTTCGAC GCTGCGCTGG CAGCGCTGAA TACGGCGCTG
GCGATCAATC CGAACGCGGC CGCCGTGATC GCCCAGCGCG GGCGGGTGTA TCAAGAGCTG
GGCAGGTTCA CCGAGGCGGA AGCCGAGTTC GACGCCGCAC TCGCCCTCGA ACCGCTGCTG
GACGCCGCAC TTTGCGGCAA GGCCACGGTG ACCCTCGCCA ATGGCAATCT CGCTTTGGCG
ATCTCCGTCA TCAACAAGGT GCTGGCACAG AACCCGAATT CGGAAATCGC CTGGACTCTG
CTCGGCGTCT GCTCTGCGAT GCAGGGCGAC ATCGCCACGG CCATCGAACA TTACGACCGC
GCGCTCGCGA TCCGACCGAA CCATGAGGAC GCGATCACCA AGAAGATCTT TGCGCTGGAT
TTCATGCCCG ATATCGGGGT GGAACATCTC CAGGAAGTCC GGCGATACTG GTGGGAGGCG
ATCGGCTCGC GATTGGAACG CCGCTCGCTC GGCAAGCGAG ATCTCGACCC GGACCGGCGG
CTTGTCGTGG GGTACGTCTC GTCCGATTTC CGCGACCATT CGGCGGCGCT CGCCTTCCTG
CCGATCCTGC AGCACCACGA CCGCACGAAG TTCGAAATCC TGGCGTATTC CTGTTCGCCG
ATGAAGGACG CCAAGACCGA GCTATGCCGC TCGCTGGTCG ATCGCTGGGT CGATGCCTCG
CTGTGGAGCG ACGACAGGCT CGCCGATCAG ATCCAGGCCG ACAAGGTCGA CATCCTCGTC
GACCTCTCGG GACATTCCGC CGGCCACCGG CTCACGATGT TTGCGCGCAA GCCCGCCCCC
ATCCAGGTCT CCGCCGTCGG CAGCGTCACC GGGACAGGCC TCCCCGTCAT GGACTATCTG
CTGGCCGATC CGGTGGTGAT CCCGGCCACG GTCCGGCATC TGTTTGCCGA AAGGATCTAC
GACCTGCCGT CGCTCATCAC GATCGAACCG CCGCCGCCAA TTCCGCCGTC GCCGCTGCCC
ATGCTTCGGA ACGGCCACGT CACCTTCGGC GCGTTCAATC GCATCGACAA GCTGTCTGAG
CCTACGCTCA GACTCTGGTC GAAGCTGATG GCTGCAACGC CCGGCTCGAT GATCGTCGTC
AAGAACCATT CGATGGGCGA TGCCCTGCTG CGCGACGGAT TAATTGCTCG CTTCGTCGCT
CACGGCATTG CCGCGGACCG GGTCATTTGT GCAGGAAAGA CGACGCGCCT GGAGCACCTT
GCGATGTTCG CCGAGATCGA CATCTCGCTC GATCCGTTCC CGCAAAATGG CGGCATCTCG
ACCTGGGAGT CGCTGCAGAT GGGCGTGCCC GTCGTCGCCA AGCTCGGCAG CGGCCCTGCC
GCACGCGCCG GCGGCGCGAT CCTCACGGCG ATCGGCCTCG ACGAATGGGT CGCCGACGAT
GATGAAGGCT ATCTCGCAAC CGCGCTGAAC TTCTGCTCCC GTCCTGACGA GCTTGCGGCG
CTCCGCGCCG CCCTGCCCGC AATCGTTTTG AATTCGGCCG CCGGCAACAG CGCGCTCTAC
ACCGCGCATG TCGAGAAGGC CTATCGGACG TTCTGGCACG ACCATTGTGC AAGGGCGCAG
AGGCGCTAG
 
Protein sequence
MQSSVGARAF QNARLQKRAK KQADALLPAA IKAYREGRQA EAQALCQQIL QDLPSHFGAL 
HLLGVSERDS GRFDEAVLVL TRAIESDPRS AEAQSDLGLA LFRLGRYEEA RARYERAIAL
RPNFPAALTH LGNTLMNLFR FEEAISAHDR AIALKPDYGE AHANRGMALM FTSRNGEAAE
SFDRALSLQP RLLTALFGKG VASMYLRDFD AALAALNTAL AINPNAAAVI AQRGRVYQEL
GRFTEAEAEF DAALALEPLL DAALCGKATV TLANGNLALA ISVINKVLAQ NPNSEIAWTL
LGVCSAMQGD IATAIEHYDR ALAIRPNHED AITKKIFALD FMPDIGVEHL QEVRRYWWEA
IGSRLERRSL GKRDLDPDRR LVVGYVSSDF RDHSAALAFL PILQHHDRTK FEILAYSCSP
MKDAKTELCR SLVDRWVDAS LWSDDRLADQ IQADKVDILV DLSGHSAGHR LTMFARKPAP
IQVSAVGSVT GTGLPVMDYL LADPVVIPAT VRHLFAERIY DLPSLITIEP PPPIPPSPLP
MLRNGHVTFG AFNRIDKLSE PTLRLWSKLM AATPGSMIVV KNHSMGDALL RDGLIARFVA
HGIAADRVIC AGKTTRLEHL AMFAEIDISL DPFPQNGGIS TWESLQMGVP VVAKLGSGPA
ARAGGAILTA IGLDEWVADD DEGYLATALN FCSRPDELAA LRAALPAIVL NSAAGNSALY
TAHVEKAYRT FWHDHCARAQ RR