Gene BBta_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1007 
Symbol 
ID5149720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp1044992 
End bp1046002 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content61% 
IMG OID640556001 
Productglycosyl transferase family protein 
Protein accessionYP_001237169 
Protein GI148252584 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.47724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCCG TCTCGAACAC CAATTCAATT GCGAGCGAAG CGACGTTGCC GATTTCCGAC 
ATTTGCGACG TCAGCGTCGT TCTGGTCAAT TACAACACTG AACATCTGTT GGAGCGCGTC
TTTGCCGCGC TGTTTGCCGC GCGACAGTCG CTCACGATGC AGACCATCGT CATCGACAAT
GCCTCGCGAG ACAATTCGGT GACGCTGCTT CGGAGCCGAT ATCCGGACGT CGAGCTGATC
GCCAATGCGA GCAATGTCGG CTTCGGACGC GCCAACAATC AGGCCCTGCC GCGCATCCGC
GGACGCTACG TCCTGCTGCT CAATACGGAT GCCTTCGTTG CCGAGGACAC GCTGACGAAG
ACGTTCGCTT ACATGGACGG ACATCCCCGG TGCGGCGTCC TCGGCGTTCG CCTCGCTGGC
GAGAGCGGGA CGCTGCAGCC CTCCTGCCGT TACTTCCCGA CGCCGCTGAA TGTCTTTGTC
GCAGAGAATG GACTGGGACG GCTCTTCCCG ACCGTGCAGA TGATCGACGA TATGAGTTGG
GATCATGCCG GGACGCGCGC CTGCGACTGG GTCCCAGGCT GCTTCTATCT GATACGAAAA
TCGGCTATCG ACCAGGTCGG CCTCTTCGAT CCGCGGTTTT TCGTCTATTA TGAGGAGGTC
GATCACTGCC GCCGGATCCG GCAAGCGGGT TGGCAGGTCA CCTATTTCGG TGATGCGACG
GTCGTTCATA TCGGCGGCGA GAGCGCAAAG GCGGATGACC GGCTGACCGC CGCAGGACGG
CAGATTGCGC GACTGCAGGT CGAAAGCGAA ATGTTGTATT TCCGCAAGCA CCATGGGCTG
ACAGGATTGC TCGCCTCCCT CGCTCTGACC TGCTGCGGGG CCGGGCTGGA CCTTCTCAAG
GATCTCGTAC GTCCCCGCAA GGATCGTCCA CGACATGCTC AGCTGCAGAA ACTGAAGCTT
GCTTTTTCCC TGCTCGGTCC GACCGGCTGG GCGACGAGAC CGACGCGGTA G
 
Protein sequence
MNAVSNTNSI ASEATLPISD ICDVSVVLVN YNTEHLLERV FAALFAARQS LTMQTIVIDN 
ASRDNSVTLL RSRYPDVELI ANASNVGFGR ANNQALPRIR GRYVLLLNTD AFVAEDTLTK
TFAYMDGHPR CGVLGVRLAG ESGTLQPSCR YFPTPLNVFV AENGLGRLFP TVQMIDDMSW
DHAGTRACDW VPGCFYLIRK SAIDQVGLFD PRFFVYYEEV DHCRRIRQAG WQVTYFGDAT
VVHIGGESAK ADDRLTAAGR QIARLQVESE MLYFRKHHGL TGLLASLALT CCGAGLDLLK
DLVRPRKDRP RHAQLQKLKL AFSLLGPTGW ATRPTR