Gene Tbis_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbis_1107 
Symbol 
ID9167597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobispora bispora DSM 43833 
KingdomBacteria 
Replicon accessionNC_014165 
Strand
Start bp1256122 
End bp1257321 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_003651722 
Protein GI296269090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.141602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACT ACCGCCGGGT GGGCGACGTC CCGCCGAAGC GCCACACGCA GCACCGGGAT 
GAGACCGGGC GGCTCTACTA CGAGGAGCTC ATGGGCGAGG AGGGCTTCTC CTCCGACTCC
TCGCTCCTCT ACCACCGGCA CCTGCCCTCG GCGATCGTCG GCTTCGAGCC GTGGGAGCCG
CCGGACCACA CCACCACGCC GAACCACCCG CTGCGCCCCC GGCACCTGCG GCTCCACGCC
CTCTTCCCGG GCGACTCCTG GCGGGACGCC GACGTGGTGA CCGGGCGCCG GATGATCCTC
GGCAACGCGG ACGTGCGCAT CTTCTACGTG GCGGCGGGCA AGGAGTCACC GCTGTACCGC
AACGCGACCG GCGACGAGCT CGTCTACATC GAGTCCGGTG AGGCCGTCGT CGAGACCGTG
TTCGGCCCGC TCCGCGCCAA GACCGGCGAC TACGTGGTCA TGCCGGCCTC CACCATCCAC
CGCTGGCTGC CCCAGGGCGG CGAGCCGCTC CGCGCCTACA TCATCGAGGC CTCCGGCCAC
GTCGCGCCGC CGAAGCGCTA CCTTTCCCGG TACGGCCAGT TCCTCGAGCA CGCGCCGTAC
TGCGAGCGGG ACCTGCACGG GCCCGAGGAG GTGCTCTGCG TCGACGGCAC CGACGTCGAG
GTCCTGGTCA AGCACCGCGG CCCGGGCGGC ATCGCCGGCA CCAGGTTCGT CTTCGAGCGC
CACCCGTTCG ACGTCGTCGG CTGGGACGGC TGCCTGTACC CCTACACCTT CAGCATCTTC
GACTTCGAGC CGATCACCGG GCGCGTCCAC CAGCCGCCGC CGGTGCACCA GGTCTTCGAG
GGGCACAACT TCGTCGTGTG CAACTTCGTG CCCCGCAAGG TCGACTACCA CCCGCAGGCC
ATCCCGGTGC CGTACTACCA CTCGAACGTC GACTCCGACG AGGTGATGTT CTACTGCGGC
GGGAACTACG AGGCGCGGAA GGGCTCCGGG ATCGGCCAGG GCTCGGTCTC GCTCCACCCC
GCCGGCCACA CCCACGGCCC GCAGCCCGGC GGGTACGAGC GGAGCATCGG CGTGGAGTTC
TTCGAGGAGT ACGCCGTCAT GGTCGACACC TTCCGCCCGC TCGAGCTCGG CGAGGCCGCG
CTCGCCTGCG ACGTCGACGG CTACCAGTTC AGCTGGGCCG CGCAGAGGCA GGGGAAGTGA
 
Protein sequence
MAYYRRVGDV PPKRHTQHRD ETGRLYYEEL MGEEGFSSDS SLLYHRHLPS AIVGFEPWEP 
PDHTTTPNHP LRPRHLRLHA LFPGDSWRDA DVVTGRRMIL GNADVRIFYV AAGKESPLYR
NATGDELVYI ESGEAVVETV FGPLRAKTGD YVVMPASTIH RWLPQGGEPL RAYIIEASGH
VAPPKRYLSR YGQFLEHAPY CERDLHGPEE VLCVDGTDVE VLVKHRGPGG IAGTRFVFER
HPFDVVGWDG CLYPYTFSIF DFEPITGRVH QPPPVHQVFE GHNFVVCNFV PRKVDYHPQA
IPVPYYHSNV DSDEVMFYCG GNYEARKGSG IGQGSVSLHP AGHTHGPQPG GYERSIGVEF
FEEYAVMVDT FRPLELGEAA LACDVDGYQF SWAAQRQGK