Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_2203 |
Symbol | |
ID | 5150896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 2275883 |
End bp | 2277088 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640557133 |
Product | putative regulatory protein (nitrile hydratase activator like) |
Protein accession | YP_001238289 |
Protein GI | 148253704 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.514388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAAC TCCCCGTCAC AGTCTTGTCC GGCTTCCTTG GGGCTGGGAA GACTACTTTG CTCAACCACA TCCTGAACAA CCGCCAGGGT CTGAAAGTGG CGGTGATCGT CAACGACATG AGCGAGGTGA ACATCGATGC GGACCTCGTT CGCGATGGCG GTGCCAATCT TTCGCGGACG GATGAGCAGT TGGTCGAAAT GACCAATGGC TGTATCTGCT GCACCCTGCG CGACGATCTG CTGAAGGAGG TTCGCGCGCT CGCCGAAAGC GGCCGGTTCG ATTACCTCGT GATCGAATCG ACCGGGATCT CCGAGCCGCT CCCGGTCGCC GCGACGTTCG ACTTCCGCGA CGAGCACGGC GCGAGCCTGT CGGACGTCGC ACGCCTCGAC ACGATGGTGA CGGTCGTGGA CGCGGTGAAT CTGCTGAAGG ATTATTCCTC GTCGGATTTC CTCGCGCAGC GCGGCGAGGC ACTGGATGGC GACCAGCGCG CGCTGGTCGA TCTCCTGGTC GAGCAGATCG AGTTCGCCGA CGTGGTCGTG CTGAACAAGG TCGACGACGC CAGCGAGGAT CAACGCGAGG CGGCGCGCAA GATCATTCGC TCGCTCAATC CCGATGCCGA TTTGATCGAG GCAAGCCATA GCTGCGTGCC GCTGGAGCGC GTGCTCGCCA CCGGCCGTTT CGACTTCGCG CGCGCTCAGC AGCATCCGCT CTGGTATAAT GAGCTCTACG GCTTTGCCGA GCACACGCCG GAGACGGAGA CCTATGGCGT GACCAGCTTC GTCTATCGCG CCCGGCGTCC GTTCGTTCCG GTTCGGTTCG ATCAGTTCCT GCGCGAGCAG TGGCCTGGGG TCATCCGCGC CAAGGGGCAC TTCTGGCTGG CCACGCGGCC GCAATGGCTC GGCGAGATCA GCCAGGCCGG AGCCATCGTG CGCACCAGCG CGCTCGGCTT CTGGTGGGCG GCGGTGCCGC AGAAATTATG GCCGGAGGAC CCCGCTTGGC GCGCACGCAT GCTCGAGCGC TGGGACGACA TCTACGGCGA CCGCCGTCAG GAGATTGTCT TCATCGGCAG CAACATGGAC GAGGCTGGGC TGCGTGCGCG CCTGGACGCC TGCCTGCTGG CCGGCAAGCC GGCGATGGAT GTCGCCGCCT GGGCCAGATT GTCCGATCCG TTTCCCGTCT GGCGACGGGC CAATGAGGCA GCCTGA
|
Protein sequence | MPKLPVTVLS GFLGAGKTTL LNHILNNRQG LKVAVIVNDM SEVNIDADLV RDGGANLSRT DEQLVEMTNG CICCTLRDDL LKEVRALAES GRFDYLVIES TGISEPLPVA ATFDFRDEHG ASLSDVARLD TMVTVVDAVN LLKDYSSSDF LAQRGEALDG DQRALVDLLV EQIEFADVVV LNKVDDASED QREAARKIIR SLNPDADLIE ASHSCVPLER VLATGRFDFA RAQQHPLWYN ELYGFAEHTP ETETYGVTSF VYRARRPFVP VRFDQFLREQ WPGVIRAKGH FWLATRPQWL GEISQAGAIV RTSALGFWWA AVPQKLWPED PAWRARMLER WDDIYGDRRQ EIVFIGSNMD EAGLRARLDA CLLAGKPAMD VAAWARLSDP FPVWRRANEA A
|
| |