Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3743 |
Symbol | |
ID | 5154142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 3919904 |
End bp | 3920902 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640558581 |
Product | putative glycerophosphoryl diester phosphodiesterase |
Protein accession | YP_001239727 |
Protein GI | 148255142 |
COG category | [C] Energy production and conversion |
COG ID | [COG0584] Glycerophosphoryl diester phosphodiesterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.166139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0832321 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAT TTGGAAAGTT GAGCGGGCTG GTGCTGCTGA TCTGTGCCGG CTTCATGTTC GTCAACAATA CCAATCTCTG GGTGGCGAAG AGAGCAGGGC AGCCGCTGCT GCTGGCGCAT CGCGGGCTGT CGCAGCAATT CGATCGCACC GGACTGACTG GCGACACCTG CACCGCCGCG CGCATGCTGC CGCCGACGCA CGACTATCTC GAGAACACCA TCGCCTCGAT GCGCGCGAGC TTCGCGGCGG GCGCCGATAT CGTCGAATTC GACGTGCATC CGACCACCGA CGGGCGCTTT GCGGTGTTCC ATGACTGGAC GCTCGACTGC CGCACCGACG GTAAGGGCGT CACGCGGGAG CATGCGATGA GGGAGCTGCG ACGCCTCGAC ATCGGCTATG GCTACACCGC CGATGGCGGC CGGACCTTTC CGTTCCGTGG CAAAGGCATC GGCCTGATGC CGTCGCTGGA TGATGTGCTC GCGACGTTTC CGGATCGGTC GTTCCTGATC AACATCAAGA GCAACGATCC GGCGGAAGGT CGGCAGCTTG CAGCGGCTCT GGCGCCAATC GATGCGGCGC AACGCGGCCG TCTGATGGTC TATGGCGGTG ACGCGCCGAT CGCGGCCCTC AGGGCGGCGC AACCCGAGCT GAGGCTCATG TCGCGGAGCA TGCTGAAGAG CTGCTTGCTG CGCTATATCG GTTATGGCTG GACCGGGCTG GTGCCCTCCG CCTGCCGCAC CATGATGGTG CTGGTCCCGA TCAATGTCGC GCCGTGGCTC TGGGGGTGGC CGGATCGGTT CCTGGCACGA ATGGCGGACG CCGGCAGCCG TGTCTTCGTC ATCGGCGCGT ATCGCGGCGA GGCATTTTCC GCGGGCCTCG ATACGCCGGC CGAGATCGCC CGGTTGCCCA CGGCATTTGG CGGCGGCGTG ATGACCGATG AGATCGAAAC CGTCGCGCCG CTGCTGAAGC GCGCTCCGCC GGGCGGCGCA TCCAACTGA
|
Protein sequence | MSRFGKLSGL VLLICAGFMF VNNTNLWVAK RAGQPLLLAH RGLSQQFDRT GLTGDTCTAA RMLPPTHDYL ENTIASMRAS FAAGADIVEF DVHPTTDGRF AVFHDWTLDC RTDGKGVTRE HAMRELRRLD IGYGYTADGG RTFPFRGKGI GLMPSLDDVL ATFPDRSFLI NIKSNDPAEG RQLAAALAPI DAAQRGRLMV YGGDAPIAAL RAAQPELRLM SRSMLKSCLL RYIGYGWTGL VPSACRTMMV LVPINVAPWL WGWPDRFLAR MADAGSRVFV IGAYRGEAFS AGLDTPAEIA RLPTAFGGGV MTDEIETVAP LLKRAPPGGA SN
|
| |