Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_6113 |
Symbol | |
ID | 5152611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 6339546 |
End bp | 6341417 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640560827 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001241946 |
Protein GI | 148257361 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.471766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACGC GCTCGATCGC GACGGTCTCC ATCTCAGGCG CGCTCGATGA GAAGCTCAAG GCGATTGCCG CGGCAGGCTT TGAAGCGGTC GAGATCTTCG AGAATGACCT GATTGCGTTC GGGTTGCGCC CGCGCGAGAT CCGGCAGATG TGCCGCGACC TGGGGCTGTC GATCTGCGCC TATCAGCCGT TCCGCGATTT CGAGGGCATG CCGGAGCCGC AGCGCGCGCG CGACTTCAAC CGCGCCGAAC GCAAATTCGA CCTGATGCAG GAGCTCGGCA CGGATCTGCT GCTGATCTGC AGCAGCGTGT CGCCGGCCTC GCTCGGCGGC ATCGACCGTG CTGCCGCCGA TTTCCACGAG CTCGGCGAGC GCGCCGCCAA GCGCGGCCTG CGCGTCGGCT ACGAGGCGCT GGCCTGGGGC CGCCACGTCC ATGACTATCG TGACGCCTGG GAGATCGTGC GCCGCGCCGA TCACAAGGCG ATCGGCGTCA TTCTCGACAG CTTCCACGCG CTGGCACCGG CGCTGCCGAC CGACGCGATC CGCGCCATCC CCGCCGACAA GATCTTTCTC GTGCAACTCG CTGACGCGCC GAAGCTCGAG CTCGACGTGC TATCCTGGAG CCGGCATTTC CGCAGCTTCC CCGGGCAGGG CGACCTCCCG GTTAGCGCGT TCATGGAGGC GGTGGCGGCG ACCGGCTATT CCGGGCCGCT GTCGCTCGAG ATCTTCAACG ACCAGTTCCG CGCAGGTTCT GCGCCGCGTA CCGCGCTCGA CGGCATGCGT TCGCTGCTGC TGCTGCAGGA TGATCTGGCC GGGACGATGC CGGCTGCGTC CGAGCCGCGC TTCGAGCCCC GCGTCAAGAG CCACGGCATC GGCTTCGTCG AGTTCGCCGT CAGCGAGGAC AAGGCGGCGT CGCTGGCGAC GTTGTTCCGC CAGCTCGGCT TCCGCAACAC CGGACGCCAT CGTAGCAAGG CAGTGCAGCG CTGGACGCAG GGCGCGGTCG AGCTGGTGAT CAATTGCGAG CCGGCGAGCT TTGCGCATTC GCATTACGTC ACGCACGGAC CCGGCGTCTG CGCGCTGGCA CTCGACGTCG ACAGCGCCGA CCGCGCCATG GCGCGCGCGC AGTCCTTGAA GACCCGCACC TTCGTGCAGC CGGTCGGTCC GGGAGAGCTG GAGATCCCGG CGATCCACGG CGTCGGCGGC AGTCTCTTGT ATTTCCTCGA CACGCACGGC CGGCATTGGG ACGTCGATTT CGAGGCTGTT ACCAGCGACG CGGGGGCCGA CCGGCTCGTT GCGGTCGATC ACATTGCGCA GTCCATGCCG CACGAGGAGA TGCTGTCCTG GCTGCTGTTC TACTCGGGGC TCATCGACTT CTCCCGCCTG CCGCAGATGG AGATCGCCGA TCCCAGGGGC CTCGTGCAGA GCCAGGCGAT CGTCAACGGC GATCGCAGCC TGCGCTTCAT CCTTAACGGC TCGACCGCGA CGCGGACGCT GTCGTCGCGC TTCATCTCCG AATTCTTCGG CTCCGGCGTC CAGCACATCG CTTTCTCCTG CGACGACATC TTCGCGGCCG TGGCCGAGAT GCGCGCCCGC GGCGCCGGCT TCCTCGACAT TCCCGACAAT TACTACGACG ATCTCGAAGC CAAATATGAT CTCGCGCCCG AGACCATCGC GGCGCTCCGC GCCAACGAGA TCCTCTACGA CCGGGACGGC GACGCGGAGT TCTTCCAGGT CTACACCCAT ATCTTCGAGG AGCGCTTCTT CTTCGAGATC GTGCAGCGGC GCAACTATCA GGGGTTTGGC GCAGCCAATG CCGCGATCCG TCTTGCGGCC CAGGCCCGCG AGGTGCGGCC CGACACGATG CCGCGACTGT AG
|
Protein sequence | MNTRSIATVS ISGALDEKLK AIAAAGFEAV EIFENDLIAF GLRPREIRQM CRDLGLSICA YQPFRDFEGM PEPQRARDFN RAERKFDLMQ ELGTDLLLIC SSVSPASLGG IDRAAADFHE LGERAAKRGL RVGYEALAWG RHVHDYRDAW EIVRRADHKA IGVILDSFHA LAPALPTDAI RAIPADKIFL VQLADAPKLE LDVLSWSRHF RSFPGQGDLP VSAFMEAVAA TGYSGPLSLE IFNDQFRAGS APRTALDGMR SLLLLQDDLA GTMPAASEPR FEPRVKSHGI GFVEFAVSED KAASLATLFR QLGFRNTGRH RSKAVQRWTQ GAVELVINCE PASFAHSHYV THGPGVCALA LDVDSADRAM ARAQSLKTRT FVQPVGPGEL EIPAIHGVGG SLLYFLDTHG RHWDVDFEAV TSDAGADRLV AVDHIAQSMP HEEMLSWLLF YSGLIDFSRL PQMEIADPRG LVQSQAIVNG DRSLRFILNG STATRTLSSR FISEFFGSGV QHIAFSCDDI FAAVAEMRAR GAGFLDIPDN YYDDLEAKYD LAPETIAALR ANEILYDRDG DAEFFQVYTH IFEERFFFEI VQRRNYQGFG AANAAIRLAA QAREVRPDTM PRL
|
| |