Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4607 |
Symbol | |
ID | 5149415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4828532 |
End bp | 4830058 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640559407 |
Product | putative sugar transferase family protein |
Protein accession | YP_001240541 |
Protein GI | 148255956 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.512572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0960232 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATG CTGCGGCCTC CGCCGCCACC ATCGCCGTCG CCGGCCAGCC GGCGATTGAA CGCCGGCGAC GCCTGTCGCC AGCCGCGCTG GCCGTCACCA ATCAAAAGGT CCACCGCGCC TATTCCCCGA TCGTGCTCGC CGGCTTCGTC CGGATTGCGG ATTTCGTGCT GCTGAGCTTC GTCGGCAGCG CCGTCTATTT CGGTTACGTC GTTCCGATCA GCGGTTTCCA TTGGGAGTAT CTCGCTGCCA TCGTCGGTAT GGCGATCAGC GCGGTGATCT GCTTCCAGGC CGCCGACATC TATCAAATCC AGGTCTTTCG CGCCCAAGTG CGGCAAATGA CCCGGATGAT CTCCTCCTAC TGCTTCGTCT TCCTGCTGTT CATCGGCCTG TCGTTTTTCG CCAAGCTCGG CAGCGAGGTC TCCCGCCTGT GGCTTGCGGC CTTCTTCTTC ATCGGTCTTG GCGCGCTGAT CACAAGTCGC GTCGTTCTCG CCAACAGGAT CCGCAGTTGG GCCAGGCAAG GTCGTCTCGA CCGTCGAACC ATCATCGTCG GCGCCGATCA GAGCGGCGAA GATCTCGTGC GCGCACTGAA ACTGCAGAAC GACTCCGAGA TCGAAATCCT CGGCGTGTTC GACGACCGCA GCGATTCCAG GTCGCTCGAC ACCTGCGCAG GCGTCCCGAA GCTCGGCAAG GTCGACGACA TCGTCGAATT CGCCCGACGC ACGCGCGTCG ACCTGGTGTT GTTCGCTTTG CCGATCTCGG CGGAGACCCG CATTCTCGAC ATGCTGAAGA AGCTGTGGGT TCTGCCTGTC GACATCCGCC TCTCGGCGCA TACCAACAAG CTGCGTTTCC GTCCTCGCTC TTATTCCTAT CTCGGCGCGG TGCCGACGCT CGACGTCTTC GAGGCCCCGA TCACCGATTG GGATCTGGTG ATGAAGTGGC TGTTCGATCG GCTGGTCGGC GCGCTGATCC TGCTGCTGGC GCTCCCTGTG ATGGCGCTGG TCGCACTGGC GATCAAGCTC GACAGCCCCG GTCCGGTGCT GTTTCGACAG AAACGCTTCG GCTTCAACAA TGAGCGCATC GACGTCTTCA AGTTCCGCTC GCTCTATCAT CACCAGGCCG ACCCCACTGC CTCCAAGGTC GTGACCAAGA ACGATCCGCG CGTCACCCGC GTCGGCCGCT TCATCCGCAA GACCAGTCTC GACGAGCTGC CGCAGCTGTT CAACGTGGTA TTCAAGGGCA ATCTGTCGCT GGTCGGTCCG CGTCCGCACG CCGTGCAGGG CAAGCTGCAG AACCGCTTGT TCGACGAAGC CGTCGACGGC TATTTCGCGC GCCACCGCGT CAAACCCGGG ATCACCGGAT GGGCGCAGAT CAATGGCTGG CGCGGCGAGA TCGACAAGGA AGAAAAGATC CAGAAGCGCG TCGAGTTCGA CCTCTATTAT ATCGAGAACT GGTCCGTTCT GCTCGACCTC TACATCCTGC TCAAGACGCC GCTTGCGCTG ATGACCAAGA GCGAGAACGC CTATTGA
|
Protein sequence | MLDAAASAAT IAVAGQPAIE RRRRLSPAAL AVTNQKVHRA YSPIVLAGFV RIADFVLLSF VGSAVYFGYV VPISGFHWEY LAAIVGMAIS AVICFQAADI YQIQVFRAQV RQMTRMISSY CFVFLLFIGL SFFAKLGSEV SRLWLAAFFF IGLGALITSR VVLANRIRSW ARQGRLDRRT IIVGADQSGE DLVRALKLQN DSEIEILGVF DDRSDSRSLD TCAGVPKLGK VDDIVEFARR TRVDLVLFAL PISAETRILD MLKKLWVLPV DIRLSAHTNK LRFRPRSYSY LGAVPTLDVF EAPITDWDLV MKWLFDRLVG ALILLLALPV MALVALAIKL DSPGPVLFRQ KRFGFNNERI DVFKFRSLYH HQADPTASKV VTKNDPRVTR VGRFIRKTSL DELPQLFNVV FKGNLSLVGP RPHAVQGKLQ NRLFDEAVDG YFARHRVKPG ITGWAQINGW RGEIDKEEKI QKRVEFDLYY IENWSVLLDL YILLKTPLAL MTKSENAY
|
| |