Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5666 |
Symbol | |
ID | 5155834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 5910794 |
End bp | 5912707 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640560404 |
Product | putative polysaccharide biosynthesis protein |
Protein accession | YP_001241526 |
Protein GI | 148256941 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.811762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGCC TGTCCCAACT AACCTCCCGC AACCTGCTGA TCGCCGTCCA CGACCTGCTC GTCAGCGTCG TCGCGCTGTT TGCCGCGTTC TATATCCGTT TCGAAGGCGC CAGCAGTTTC TGGGAGCGGG TCCCGCTGCT CCTGCAGCTC CTGCCTTATT TCATCGTCTT CAGCTTCGTG GTCTTCTATT TCTCCAACCT GCTGACCACG AAATGGCGCT TCATTTCGCT GCCGGACGCG CTCAACATCG TGCGCGTCGC GACGGTGCTG ACGCTGGCCC TGCTGGTGCT GGACTACATC ATCGTCGCTC CGAATGTCCG CGGCACCTTC TTCTTCGGCA AGCTCACGAT CATCCTGTAC TGGTTCCTGG AGATCTTCGC GCTGAGCACG CTAAGGTTTG CCTATCGGTA TTTCCGCTAC ACGAGGGTGC GCCACCACGC GAGATCCGAG GATGCCGCGC CGGCGCTGCT GATCGGCCGC ACGGCGGACG CCGAGGTGGT CTTGCGTGGC ATCGAAAGCG GCGCGGTCAA GCGGATCTGG CCGATCGGAA TCCTGTCGCC TTCGATGGCC GACCGTGGTC AGTTGATCCG CGGCGTGCCG GTGCTGGGCG GCATCGACGA CATCGAGGAC GTCGCCACGG ACTTTGCCCG CAGGGGCAAG CCGATCGCGC GCGCGCTGAT GATGCCGACC GCGTTCGAGC CGGACTGCCA TCCGGAGACC TTCCTGATGC GCGCCCGACG GCTGCGCTTG ATCATCAGCC GGCTGCCTTC GCTGGAGAGC GGCGATGCTC CGCGACTGGC GCCGGTGGCG GTCGAGGACT TGCTGCTGCG GCCGAGCGAG AAGATCGATT ACGGACGGCT CGAGGCGCTG GTGAGAGGCA AGGCGGTCGT CGTGACCGGC GGCGGCGGCT CGATCGGCGC CGAGATCTGC CAGCGTGTGG TGGCGTTCGG CGCGGCGCGG CTGCTCATCC TGGAGAATTC CGAGCCGGCG CTCTATGCGA TCACCGAGGT ACTCGCGGCC GCCCCCGAGG CCAAGGTCGC CGTCGAGGGA CGGATCGCCG ACATTCGCGA TCGCGACCGG GTGTTCGCGC TGATCGGCGA GTTCAAGCCG GATCTCGTGT TTCACGCGGC CGCCCTGAAA CATGTGCCGA TCCTCGAGCG TGATTGGAGC GAGGGCGTCA AGACCAACAT CTTCGGCTCG ATCAACGTCG CCGATGCCGC GGTGGCGTCG GGCGCGGAGG CGATGGTGAT GATCTCGACG GACAAGGCGA TCGAGCCGGT GTCGATGCTC GGACTCACCA AACGTTTCGC CGAGATGTAT TGTCAGGCGC TCGACCGCCA GCTTGCGGAG CGCCCGATCG GCCGGCCGCC GATGCGGCTG ATCTCCGTGC GCTTCGGCAA TGTGCTGGCG TCGAACGGCT CGGTGGTGCC GAAGTTCAAG GCCCAGATCG AGGCGGGCGG CCCGGTCACG GTGACTCATC CGGACATGGT GCGCTACTTC ATGACCATTC GCGAGGCCTG TGACCTCGTG ATCACCGCGG CCTCGCATGC GCTGGCACCG GAGCGGCCGG GGGTCTCGGT GTACGTGCTG AACATGGGCC AGCCGGTCCG CATCGTCGAA CTGGCGGAGC GCATGATCCG CCTCTCCGGA CTGCAGCCGG GCTACGACAT CGAGATCGTG TTCACCGGGA TTCGTCCAGG CGAGCGCCTG AACGAGATCC TGTTCGCGAG CGAGGAGCCG CCGGTCGAGA TCGGCGTCGC CGGCATCATG GCCGCCAAGC CGAACGAGCC GCCGATGCAG ACTCTGAAGG GATGGATCGC GGCGCTGGAC CAGGCCATTG CGCGGAACGA TCCGGTGACG ATCAAGGCGG TGCTGAAGGA CGCGGTGCCT GAGTTCGGTT CAAGCGCCGC CTGA
|
Protein sequence | MTRLSQLTSR NLLIAVHDLL VSVVALFAAF YIRFEGASSF WERVPLLLQL LPYFIVFSFV VFYFSNLLTT KWRFISLPDA LNIVRVATVL TLALLVLDYI IVAPNVRGTF FFGKLTIILY WFLEIFALST LRFAYRYFRY TRVRHHARSE DAAPALLIGR TADAEVVLRG IESGAVKRIW PIGILSPSMA DRGQLIRGVP VLGGIDDIED VATDFARRGK PIARALMMPT AFEPDCHPET FLMRARRLRL IISRLPSLES GDAPRLAPVA VEDLLLRPSE KIDYGRLEAL VRGKAVVVTG GGGSIGAEIC QRVVAFGAAR LLILENSEPA LYAITEVLAA APEAKVAVEG RIADIRDRDR VFALIGEFKP DLVFHAAALK HVPILERDWS EGVKTNIFGS INVADAAVAS GAEAMVMIST DKAIEPVSML GLTKRFAEMY CQALDRQLAE RPIGRPPMRL ISVRFGNVLA SNGSVVPKFK AQIEAGGPVT VTHPDMVRYF MTIREACDLV ITAASHALAP ERPGVSVYVL NMGQPVRIVE LAERMIRLSG LQPGYDIEIV FTGIRPGERL NEILFASEEP PVEIGVAGIM AAKPNEPPMQ TLKGWIAALD QAIARNDPVT IKAVLKDAVP EFGSSAA
|
| |