Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5330 |
Symbol | |
ID | 5155986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 5559682 |
End bp | 5560680 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640560092 |
Product | hypothetical protein |
Protein accession | YP_001241216 |
Protein GI | 148256631 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.12369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.395647 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGAACC TCCTCACCGA TATCGCAGGC GTCCGCGTCG GCCACGCCGA TGATGCCGCG CTTGCCTCCG GCGTGACCGC CATCCTGTTC GATCAGCCCG CGGTGGCCGC GATCGACATT CGCGGTGGCG GGCCGGGCGT GCGCGAGGAT GCCCTGCTCG ATCCCGCCAA CACGGTGGAG CGGATCGACG CCATTGCGCT GTCAGGCGGC TCGGCATTCG GGCTGGAGGC CGCGAGCGGC ATCCAGGCCT GGCTTGCCGA GCAGGGCCGC GGCTTTGCGG TGCGCGATGC GCTGATCCCG ATCGTGCCCG GCGCGATCGT GTTCGACCTG CTCAATGGCG GCAACAAGGC CTGGGGCCGC TATGCGCCGT ATCGCGAGCT CGGCTATCGG GCGGCGGCCG CCGCCGGCCC CGGCTTCGCG CTGGGGAGCG TCGGCGCCGG GCTCGGCGCC ACCACCGCCA ACCTGCAGGG CGGCCTCGGC TCGGCATCGG CCACGACCAC CAAAGGCGTC AAGGTCGCGG CGATCGCCGT GGTCAATGCG GTCGGCAGCG TCACCGTCGG CGATGGTCCG TGGTTCTGGG CCGCGCCCTA TGAGGTGGAT GGCGAGTTCG GTGGCCGCGG CCTGCCGGCG GCGTTCACGC CGGACATGCT GGCGATGCGC ATCAAGGGCG GTCCGGCCGC ATCGAGCGCG GAGAATACGA CGATCAGCCT GGTCGTGACC GATGCCGTCC TGACCAAGGC TCAGGCCAAG CGGCTCGCGA TGATGGCGCA GACCGGCATG GCGCGCGCGA TCTATCCGGT CCATCTGCCG CTCGACGGCG ATATCGTGTT CGCGGCGGCG ACCTGCGCCA GGCCGATCGA GCCGCTGGTC GAGCTGAGCG AACTCGGCAT GGTGGCGGCC AACGTGCTGG CGCGGGCGAT CGCGCGCGGT GTCTATCACG CCAAGCGGCT GCCGTTCGCA GGCGCCCTGC CCTCCTGGCA GGACCGCTTC GGCGGCTGA
|
Protein sequence | MQNLLTDIAG VRVGHADDAA LASGVTAILF DQPAVAAIDI RGGGPGVRED ALLDPANTVE RIDAIALSGG SAFGLEAASG IQAWLAEQGR GFAVRDALIP IVPGAIVFDL LNGGNKAWGR YAPYRELGYR AAAAAGPGFA LGSVGAGLGA TTANLQGGLG SASATTTKGV KVAAIAVVNA VGSVTVGDGP WFWAAPYEVD GEFGGRGLPA AFTPDMLAMR IKGGPAASSA ENTTISLVVT DAVLTKAQAK RLAMMAQTGM ARAIYPVHLP LDGDIVFAAA TCARPIEPLV ELSELGMVAA NVLARAIARG VYHAKRLPFA GALPSWQDRF GG
|
| |