Gene BBta_6113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_6113 
Symbol 
ID5152611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp6339546 
End bp6341417 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content67% 
IMG OID640560827 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001241946 
Protein GI148257361 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.471766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACGC GCTCGATCGC GACGGTCTCC ATCTCAGGCG CGCTCGATGA GAAGCTCAAG 
GCGATTGCCG CGGCAGGCTT TGAAGCGGTC GAGATCTTCG AGAATGACCT GATTGCGTTC
GGGTTGCGCC CGCGCGAGAT CCGGCAGATG TGCCGCGACC TGGGGCTGTC GATCTGCGCC
TATCAGCCGT TCCGCGATTT CGAGGGCATG CCGGAGCCGC AGCGCGCGCG CGACTTCAAC
CGCGCCGAAC GCAAATTCGA CCTGATGCAG GAGCTCGGCA CGGATCTGCT GCTGATCTGC
AGCAGCGTGT CGCCGGCCTC GCTCGGCGGC ATCGACCGTG CTGCCGCCGA TTTCCACGAG
CTCGGCGAGC GCGCCGCCAA GCGCGGCCTG CGCGTCGGCT ACGAGGCGCT GGCCTGGGGC
CGCCACGTCC ATGACTATCG TGACGCCTGG GAGATCGTGC GCCGCGCCGA TCACAAGGCG
ATCGGCGTCA TTCTCGACAG CTTCCACGCG CTGGCACCGG CGCTGCCGAC CGACGCGATC
CGCGCCATCC CCGCCGACAA GATCTTTCTC GTGCAACTCG CTGACGCGCC GAAGCTCGAG
CTCGACGTGC TATCCTGGAG CCGGCATTTC CGCAGCTTCC CCGGGCAGGG CGACCTCCCG
GTTAGCGCGT TCATGGAGGC GGTGGCGGCG ACCGGCTATT CCGGGCCGCT GTCGCTCGAG
ATCTTCAACG ACCAGTTCCG CGCAGGTTCT GCGCCGCGTA CCGCGCTCGA CGGCATGCGT
TCGCTGCTGC TGCTGCAGGA TGATCTGGCC GGGACGATGC CGGCTGCGTC CGAGCCGCGC
TTCGAGCCCC GCGTCAAGAG CCACGGCATC GGCTTCGTCG AGTTCGCCGT CAGCGAGGAC
AAGGCGGCGT CGCTGGCGAC GTTGTTCCGC CAGCTCGGCT TCCGCAACAC CGGACGCCAT
CGTAGCAAGG CAGTGCAGCG CTGGACGCAG GGCGCGGTCG AGCTGGTGAT CAATTGCGAG
CCGGCGAGCT TTGCGCATTC GCATTACGTC ACGCACGGAC CCGGCGTCTG CGCGCTGGCA
CTCGACGTCG ACAGCGCCGA CCGCGCCATG GCGCGCGCGC AGTCCTTGAA GACCCGCACC
TTCGTGCAGC CGGTCGGTCC GGGAGAGCTG GAGATCCCGG CGATCCACGG CGTCGGCGGC
AGTCTCTTGT ATTTCCTCGA CACGCACGGC CGGCATTGGG ACGTCGATTT CGAGGCTGTT
ACCAGCGACG CGGGGGCCGA CCGGCTCGTT GCGGTCGATC ACATTGCGCA GTCCATGCCG
CACGAGGAGA TGCTGTCCTG GCTGCTGTTC TACTCGGGGC TCATCGACTT CTCCCGCCTG
CCGCAGATGG AGATCGCCGA TCCCAGGGGC CTCGTGCAGA GCCAGGCGAT CGTCAACGGC
GATCGCAGCC TGCGCTTCAT CCTTAACGGC TCGACCGCGA CGCGGACGCT GTCGTCGCGC
TTCATCTCCG AATTCTTCGG CTCCGGCGTC CAGCACATCG CTTTCTCCTG CGACGACATC
TTCGCGGCCG TGGCCGAGAT GCGCGCCCGC GGCGCCGGCT TCCTCGACAT TCCCGACAAT
TACTACGACG ATCTCGAAGC CAAATATGAT CTCGCGCCCG AGACCATCGC GGCGCTCCGC
GCCAACGAGA TCCTCTACGA CCGGGACGGC GACGCGGAGT TCTTCCAGGT CTACACCCAT
ATCTTCGAGG AGCGCTTCTT CTTCGAGATC GTGCAGCGGC GCAACTATCA GGGGTTTGGC
GCAGCCAATG CCGCGATCCG TCTTGCGGCC CAGGCCCGCG AGGTGCGGCC CGACACGATG
CCGCGACTGT AG
 
Protein sequence
MNTRSIATVS ISGALDEKLK AIAAAGFEAV EIFENDLIAF GLRPREIRQM CRDLGLSICA 
YQPFRDFEGM PEPQRARDFN RAERKFDLMQ ELGTDLLLIC SSVSPASLGG IDRAAADFHE
LGERAAKRGL RVGYEALAWG RHVHDYRDAW EIVRRADHKA IGVILDSFHA LAPALPTDAI
RAIPADKIFL VQLADAPKLE LDVLSWSRHF RSFPGQGDLP VSAFMEAVAA TGYSGPLSLE
IFNDQFRAGS APRTALDGMR SLLLLQDDLA GTMPAASEPR FEPRVKSHGI GFVEFAVSED
KAASLATLFR QLGFRNTGRH RSKAVQRWTQ GAVELVINCE PASFAHSHYV THGPGVCALA
LDVDSADRAM ARAQSLKTRT FVQPVGPGEL EIPAIHGVGG SLLYFLDTHG RHWDVDFEAV
TSDAGADRLV AVDHIAQSMP HEEMLSWLLF YSGLIDFSRL PQMEIADPRG LVQSQAIVNG
DRSLRFILNG STATRTLSSR FISEFFGSGV QHIAFSCDDI FAAVAEMRAR GAGFLDIPDN
YYDDLEAKYD LAPETIAALR ANEILYDRDG DAEFFQVYTH IFEERFFFEI VQRRNYQGFG
AANAAIRLAA QAREVRPDTM PRL