Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_6226 |
Symbol | thiC |
ID | 5149212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 6472979 |
End bp | 6474880 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640560938 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001242055 |
Protein GI | 148257470 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.642096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCC GCTCCAATCC CGACACGACA CTGCCCGCCG TGACCACCGG TCCGCTGCCC TCCTCGCGCA AGATCTATGC GGCGCCGGAC TCCGCGCCCG ATTTGCGCGT GCCGCTGCGC GAGATCATCC TCTCCGAGGC CGCCGGCGAG CCGAACCTGC CGGTGTACGA CACGTCGGGC CCCTACACCG ATCCCTCCGT CACCATCGAC GTCAATGCCG GCCTGCCGCG CAACCGCACG GCCTGGGTGC TCGAACGCGG TGGCGTCGAG GAGTATGAGG GGCGCGACAT CAAGCCGGAG GACAACGGCA ATGTCGGCGC CGACAAGGCG GCGAAAGCCT TCATCGCGCA TCACAAGCCG CTGCGCGGCC TCGACGGTCA CAAGATCACC CAGCTCGAAT TCGCCCGCGC CGGCATCATC ACCAAGGAGA TGATCTACGT CGCCGAGCGC GAGAATCTTG GCCGCAAGCA GCAGCTGGAA CGCGCCGAAG CCGCGCTCGC CGACGGCGAG AGTTTTGGCG CCGAGGTCCC CGCCTTCATC ACGCCGGAGT TCGTGCGCTC CGAGATCGCG CGGGGACGGG CCATCATCCC CTGCAACATC AACCACGCCG AATTGGAGCC GATGATCATC GGCCGCAACT TTCTCACCAA GATCAACGCC AATATCGGCA ACTCGGCGGT GACCTCGTCC GTCGAGGAGG AAGTGGACAA GATGGTGTGG GCGATCCGCT GGGGCGCCGA CACGGTGATG GACCTCTCGA CCGGCCGCAA CATCCACACC ACCCGCGAAT GGATCCTGCG CAACTCGCCG GTGCCGATCG GCACGGTGCC GATCTACCAG GCGCTGGAGA AGTGCAACGG CGATCCGGTC AAGCTGACCT GGGAGCTCTA CAAGGACACG CTGATCGAGC AGTGCGAACA GGGCGTCGAC TACTTCACCA TCCATGCCGG CGTGCGGCTG CAATACATCC ACCTCACCGC GTCACGCGTC ACCGGCATCG TCTCGCGCGG CGGCTCGATC ATGGCCAAGT GGTGCCTCGC GCATCACAAG GAGAGCTTCC TCTACACGCA TTTCGACGAG ATCTGCGACA TCATGCGCAA GTATGACGTC TCGTTCTCGC TCGGCGACGG CCTGCGCCCC GGCTCGATCG CGGACGCTAA CGACCGCGCG CAGTTCGCGG AGCTGGAGAC GCTCGGCGAG CTCACCAAGA TCGCCTGGGA CAAGGGCTGC CAGGTCATGA TCGAGGGCCC CGGCCACGTG CCGATGCACA AGATCAAGAT CAACATGGAC AAGCAGCTCA AGGAGTGCGG CGAGGCGCCG TTCTATACGC TCGGGCCGCT GACGACGGAC ATCGCACCCG GCTACGACCA CATCACGTCA GGCATCGGCG CCGCCATGAT CGGCTGGTTC GGCTGCGCGA TGCTGTGCTA CGTGACGCCG AAGGAGCATC TCGGTCTGCC CGATCGCAAC GACGTCAAGA CCGGCGTCAT CACCTACAAG ATCGCCGCCC ACGCCGCCGA TCTCGCCAAG GGCCATCCCG CCGCGCAGCT GCGCGACGAT GCGCTGTCCC GCGCCCGCTT CGAGTTCCGC TGGACCGACC AGTTCAACCT CGGTCTCGAC CCGGACACGG CCAAGAGCTT CCACGACGAG ACCCTGCCCA AGGAAGCCCA CAAGGTCGCG CATTTCTGCT CGATGTGCGG CCCGAAATTC TGCTCGATGA AGATCACCCA GGACGTCCGC GACTACGCCG CCACGCTGAA CGATCCGACC ACCGTCGGCG TGACCATCAG CGGCACCATC GAAGACGGCA TGGCCCAGAT GAGCGCCAAG TTCAAGGAAA TGGGCGGCAG CGTGTATCTC GATGCTGACA AAGTGAAGGA GAGCAACAAG GCGTTGTCCT GA
|
Protein sequence | MNIRSNPDTT LPAVTTGPLP SSRKIYAAPD SAPDLRVPLR EIILSEAAGE PNLPVYDTSG PYTDPSVTID VNAGLPRNRT AWVLERGGVE EYEGRDIKPE DNGNVGADKA AKAFIAHHKP LRGLDGHKIT QLEFARAGII TKEMIYVAER ENLGRKQQLE RAEAALADGE SFGAEVPAFI TPEFVRSEIA RGRAIIPCNI NHAELEPMII GRNFLTKINA NIGNSAVTSS VEEEVDKMVW AIRWGADTVM DLSTGRNIHT TREWILRNSP VPIGTVPIYQ ALEKCNGDPV KLTWELYKDT LIEQCEQGVD YFTIHAGVRL QYIHLTASRV TGIVSRGGSI MAKWCLAHHK ESFLYTHFDE ICDIMRKYDV SFSLGDGLRP GSIADANDRA QFAELETLGE LTKIAWDKGC QVMIEGPGHV PMHKIKINMD KQLKECGEAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP KEHLGLPDRN DVKTGVITYK IAAHAADLAK GHPAAQLRDD ALSRARFEFR WTDQFNLGLD PDTAKSFHDE TLPKEAHKVA HFCSMCGPKF CSMKITQDVR DYAATLNDPT TVGVTISGTI EDGMAQMSAK FKEMGGSVYL DADKVKESNK ALS
|
| |