Gene BBta_4607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4607 
Symbol 
ID5149415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4828532 
End bp4830058 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content62% 
IMG OID640559407 
Productputative sugar transferase family protein 
Protein accessionYP_001240541 
Protein GI148255956 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.512572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0960232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGATG CTGCGGCCTC CGCCGCCACC ATCGCCGTCG CCGGCCAGCC GGCGATTGAA 
CGCCGGCGAC GCCTGTCGCC AGCCGCGCTG GCCGTCACCA ATCAAAAGGT CCACCGCGCC
TATTCCCCGA TCGTGCTCGC CGGCTTCGTC CGGATTGCGG ATTTCGTGCT GCTGAGCTTC
GTCGGCAGCG CCGTCTATTT CGGTTACGTC GTTCCGATCA GCGGTTTCCA TTGGGAGTAT
CTCGCTGCCA TCGTCGGTAT GGCGATCAGC GCGGTGATCT GCTTCCAGGC CGCCGACATC
TATCAAATCC AGGTCTTTCG CGCCCAAGTG CGGCAAATGA CCCGGATGAT CTCCTCCTAC
TGCTTCGTCT TCCTGCTGTT CATCGGCCTG TCGTTTTTCG CCAAGCTCGG CAGCGAGGTC
TCCCGCCTGT GGCTTGCGGC CTTCTTCTTC ATCGGTCTTG GCGCGCTGAT CACAAGTCGC
GTCGTTCTCG CCAACAGGAT CCGCAGTTGG GCCAGGCAAG GTCGTCTCGA CCGTCGAACC
ATCATCGTCG GCGCCGATCA GAGCGGCGAA GATCTCGTGC GCGCACTGAA ACTGCAGAAC
GACTCCGAGA TCGAAATCCT CGGCGTGTTC GACGACCGCA GCGATTCCAG GTCGCTCGAC
ACCTGCGCAG GCGTCCCGAA GCTCGGCAAG GTCGACGACA TCGTCGAATT CGCCCGACGC
ACGCGCGTCG ACCTGGTGTT GTTCGCTTTG CCGATCTCGG CGGAGACCCG CATTCTCGAC
ATGCTGAAGA AGCTGTGGGT TCTGCCTGTC GACATCCGCC TCTCGGCGCA TACCAACAAG
CTGCGTTTCC GTCCTCGCTC TTATTCCTAT CTCGGCGCGG TGCCGACGCT CGACGTCTTC
GAGGCCCCGA TCACCGATTG GGATCTGGTG ATGAAGTGGC TGTTCGATCG GCTGGTCGGC
GCGCTGATCC TGCTGCTGGC GCTCCCTGTG ATGGCGCTGG TCGCACTGGC GATCAAGCTC
GACAGCCCCG GTCCGGTGCT GTTTCGACAG AAACGCTTCG GCTTCAACAA TGAGCGCATC
GACGTCTTCA AGTTCCGCTC GCTCTATCAT CACCAGGCCG ACCCCACTGC CTCCAAGGTC
GTGACCAAGA ACGATCCGCG CGTCACCCGC GTCGGCCGCT TCATCCGCAA GACCAGTCTC
GACGAGCTGC CGCAGCTGTT CAACGTGGTA TTCAAGGGCA ATCTGTCGCT GGTCGGTCCG
CGTCCGCACG CCGTGCAGGG CAAGCTGCAG AACCGCTTGT TCGACGAAGC CGTCGACGGC
TATTTCGCGC GCCACCGCGT CAAACCCGGG ATCACCGGAT GGGCGCAGAT CAATGGCTGG
CGCGGCGAGA TCGACAAGGA AGAAAAGATC CAGAAGCGCG TCGAGTTCGA CCTCTATTAT
ATCGAGAACT GGTCCGTTCT GCTCGACCTC TACATCCTGC TCAAGACGCC GCTTGCGCTG
ATGACCAAGA GCGAGAACGC CTATTGA
 
Protein sequence
MLDAAASAAT IAVAGQPAIE RRRRLSPAAL AVTNQKVHRA YSPIVLAGFV RIADFVLLSF 
VGSAVYFGYV VPISGFHWEY LAAIVGMAIS AVICFQAADI YQIQVFRAQV RQMTRMISSY
CFVFLLFIGL SFFAKLGSEV SRLWLAAFFF IGLGALITSR VVLANRIRSW ARQGRLDRRT
IIVGADQSGE DLVRALKLQN DSEIEILGVF DDRSDSRSLD TCAGVPKLGK VDDIVEFARR
TRVDLVLFAL PISAETRILD MLKKLWVLPV DIRLSAHTNK LRFRPRSYSY LGAVPTLDVF
EAPITDWDLV MKWLFDRLVG ALILLLALPV MALVALAIKL DSPGPVLFRQ KRFGFNNERI
DVFKFRSLYH HQADPTASKV VTKNDPRVTR VGRFIRKTSL DELPQLFNVV FKGNLSLVGP
RPHAVQGKLQ NRLFDEAVDG YFARHRVKPG ITGWAQINGW RGEIDKEEKI QKRVEFDLYY
IENWSVLLDL YILLKTPLAL MTKSENAY