Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_6705 |
Symbol | |
ID | 5155052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 6983706 |
End bp | 6985256 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640561393 |
Product | choline dehydrogenase |
Protein accession | YP_001242507 |
Protein GI | 148257922 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0425945 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGACC TCATTGTCGT CGGCGGCGGC TCCGCGGGAG CGGCGGTCGC AGCGCGTCTG TCCGAACAGC CGGAACGGCG CGTGCTGCTG CTGGAGGCCG GCGCCGACTG GCGCGCGGAC GAGGTGCCAT GGGAGATCGC GACGCCCAAT CCGATCCCGA TCATCCATGA TCGGGCCTTC CAGGAGAAGT GGCAGTGGCC GCAGCTGATG TCGCGCCGGG TGGCAGGCCA GGAGATGCGG TTCTACTGGC GCGGCAAGGG GCTCGGCGGC AGCTCGATGA TGAACGGCCA GATCGCGATC CGCGGTGTGG CCGAGGCGTT CGACCAATGG GCTGCCCAGG GCTGCACCGG ATGGTCGGCG CGTGAGATCA TGCCGCTGTT CTCCGTGATC GAGGACGACC TCACTTATGG CGATCGCGAC GGGCACGGCC AGGGCGGTCC GCTGCCGGTC TATCGGGCGC CGGCGGACCA GTGGGGGCCG ATCGACGGTG GCTTGCGCGA TGCGGCGCTG GCGAGCGGCT ATCCCTGGTG CGACGACCTC AACGGGCCCG ATGGCGAGGG CGTCGCCTGC TATCCCATCA ACAGCCGCAA CGGCCGCCGC ATCTCGACCA ACGAAGCCTA TCTGGAGCCG GCGCGAGGCC GTTCGAACCT GGAGATCCGC GGCGGCGCGC TGGTCGATCG TGTCCTGATC AGCGATGGCC GCGCGACCGG CGTGCGCGTG CTTCTCGACG GCGAGGGCGC CAGGGAGATC GCGGCGCGCG AGGTCGTGCT ATGCGCCGGG GCCATCCACA GCCCGGCGAT CCTGTTGCGT TCGGGCCTTG GACCCGCGGC CGAGCTGCGG CACATGGGCA TCGCCGTGTT GCGCGATCTG CCGGTGGGAC GGCACTTCTT CGATCATCCC TTGTTCCGCA CCACTATCCA GCTGCGCGAG GCGCTGCGTC CGACCGACCG CGACACGCGC CACACCAATT GCTGCGTGAC CTATTCGTCC GGCTTGGCCG AGGGCGGCAA GCGCGACATG ATCCTGATCG CGTTCAACCA CCGTGGCATC GGCGTGCCCG GCGCGATCGG CGCCGGCCTC TTCAACGCCT TCTCGCGCGG CACGCTGAAG CTCGCCTCGC CTGATCCCTC CATCGATCCG ATCGTGGACG AGAATATGCT CGCCGATCCC CGCGACATCG CGCGGATGCG GGACGCGGTG AAGCGGCTGG CCGTGATCAC GCAGCAGCCG GCGCTACAGG GTATCGCCGA TTGGATCCGC CTCGGCGATA CCGAACTCAC GCTGCCGCAG GCGGCCGCGC TGCCGGATGC CGAGCTCGAC GCGCTGCTGC GCCGGGTGAC CGGCGATATT CAGCATGCCG CCGGCAGCTG TCGCATGAGC GGCTTCGCCG ACGCCGATGG CGTCGTCAAT CCCGACGGGA CTGTCAAGGG GATCGGCGGC CTGCGCGTCG CCGACGCCTC GATCATGCCG GCCGATTGTC GTGCCAACAC GCATTTCACC ACGGTCGTGA TCGGGGAAGC GATCGCGCGG ATGATGCGGA AGAGGACGTA A
|
Protein sequence | MYDLIVVGGG SAGAAVAARL SEQPERRVLL LEAGADWRAD EVPWEIATPN PIPIIHDRAF QEKWQWPQLM SRRVAGQEMR FYWRGKGLGG SSMMNGQIAI RGVAEAFDQW AAQGCTGWSA REIMPLFSVI EDDLTYGDRD GHGQGGPLPV YRAPADQWGP IDGGLRDAAL ASGYPWCDDL NGPDGEGVAC YPINSRNGRR ISTNEAYLEP ARGRSNLEIR GGALVDRVLI SDGRATGVRV LLDGEGAREI AAREVVLCAG AIHSPAILLR SGLGPAAELR HMGIAVLRDL PVGRHFFDHP LFRTTIQLRE ALRPTDRDTR HTNCCVTYSS GLAEGGKRDM ILIAFNHRGI GVPGAIGAGL FNAFSRGTLK LASPDPSIDP IVDENMLADP RDIARMRDAV KRLAVITQQP ALQGIADWIR LGDTELTLPQ AAALPDAELD ALLRRVTGDI QHAAGSCRMS GFADADGVVN PDGTVKGIGG LRVADASIMP ADCRANTHFT TVVIGEAIAR MMRKRT
|
| |