Gene BBta_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4109 
Symbol 
ID5154843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4314226 
End bp4315596 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content64% 
IMG OID640558943 
Productputative glycosyltransferase 
Protein accessionYP_001240081 
Protein GI148255496 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.114761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGGT TTCTCATCGC GGCTACGGCG CTACCTGGTC ACGTCTTTCC TTTGTTGGCC 
GTATCCCAAC TCCTCGTGAG TCGGGGCCAC GAGGTCGTCG TCAATACCGG CAGTCTGTTC
AGGGAGCGAG TGGAGGCGAC CGGCGCACGA TTCGTCGCGT TTCGCCCGGA GATCGATCAC
GATTATCGGA AACTCGACGA GCACTTTCCC GAACGCCGCA AGATTGCACC TGGTCCCGCG
CTGCTCTCCT TCGGACTGAA GCACATCTTT GCGGATGCAA TACCGCATCA GGCCGCCGGG
ATACGCGACT TGCACCGCGA GTTTCCTTTC GATGCGATCA TCACCGATAC GATGTTCTGC
GGGACGTTTC CGCTTCTGCT GGGCCCGCCG GAAAAAAGAG TCCCGATCAT CGGGCTTGGG
ATCACGGCGC TCGCTCTGTC GAGCGCAGAC ACCGCCTTCT TCGGCACCGC GCTCCCGCCA
TCCATCACGC CGGCCGATCG CGCGCGCAAT GCTGCGATGA ATAGCCACCT GCAGAACGTG
ATGTTCGGCC CGGTGCAGCA ATACTTCAAC GATGTCCTCG TGAGGATCGG GGCGCGCCCT
CTGCCGGCAT TCCTGTTCGA TAGCATGATC ACGCTGCCTG ACCTGTATTT GCAGCTGACG
GCCTGGGAGT TCGAATATCC GCGTGGCACG ATGCCAAGCA GCATCCGCTT CGTCGGCCCG
CTGCTGCCGC CGCCATCGAC CGGCTTCCGC CCGCCGCCTT GGTGGGACGA GATCGACAAT
GCCGGCCCGA TCGTGCTGGT CACTCAAGGC ACGCTCGCAA ACGAGGATCT TGGGCAGCTT
GTCGGGCCAA CGCTGAGGGG CTTGGCGAAC GAGGACCTGA CGGTGATCGC GTGCACCGGC
GGCCCACCAA CCGAATCGAT CCCGGTGGTG GTGCCGCCCA ATGCGAGGGC GGCGACATTC
CTGCCGTTCG ACCGCCTGCT CCCGAAGGTG AGCGTCATGG TGACCAATGG CGGCTATGGC
GGCGTCAATC ACGCGCTCAG TCTCGGTGTG CCGTTGGTCG TCGCCGGGGA TAGTGAGGAA
AAGCCTGAAA TTGCCGCGCG CGTCGCGTGG GCGGGCGCCG GCATCAATCT TGGGACCGGC
CGGCCATCCG CCTCGCAGAT TCGCGACGCG GTCCGCGCCG TGCTCACCAC GCCGCAATAC
CGGCAGCGTG CACAGGCGCT GCGCGCCGCG TTCGCCAGCT ACAACGCTCG TAACGAGATT
GCGGAAAGGG TCGAACAGCT CGCGGCCACC GGATTGCCAG CCAGCTCTCA GGGAGATCCA
CCTCGCAGGT TCGACCTTGT CGGGACTCCT GCGGAGCCTG CAAGTCGCTG A
 
Protein sequence
MARFLIAATA LPGHVFPLLA VSQLLVSRGH EVVVNTGSLF RERVEATGAR FVAFRPEIDH 
DYRKLDEHFP ERRKIAPGPA LLSFGLKHIF ADAIPHQAAG IRDLHREFPF DAIITDTMFC
GTFPLLLGPP EKRVPIIGLG ITALALSSAD TAFFGTALPP SITPADRARN AAMNSHLQNV
MFGPVQQYFN DVLVRIGARP LPAFLFDSMI TLPDLYLQLT AWEFEYPRGT MPSSIRFVGP
LLPPPSTGFR PPPWWDEIDN AGPIVLVTQG TLANEDLGQL VGPTLRGLAN EDLTVIACTG
GPPTESIPVV VPPNARAATF LPFDRLLPKV SVMVTNGGYG GVNHALSLGV PLVVAGDSEE
KPEIAARVAW AGAGINLGTG RPSASQIRDA VRAVLTTPQY RQRAQALRAA FASYNARNEI
AERVEQLAAT GLPASSQGDP PRRFDLVGTP AEPASR