Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6395 |
Symbol | |
ID | 6983467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | + |
Start bp | 42222 |
End bp | 44048 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643399393 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002284149 |
Protein GI | 209552234 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0500237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTG CCGTAGAGAA TCTCACCCCG ACCGTCACCA CCGGCCCGCT TCCGGCATCG CGGAAGATCC ATATTCCGGG AGATATCCAT GGCGATATCC GCGTGCCGAT GCGCGAAATC AGCGTCCATC CGACCGCCGG CGAACCGCCT GTCGTCGTCT ATGATTCCTC CGGCCCTTAC ACGATCGAAG GTGCGGAGAT CCGCATCGAG CAGGGCCTAC CCCAACTGCG CCGCGACTGG GTGCTGGCCC GCGGCGATGT CGAAGCCTAT GACGGCCGCC ATGTCCGCCC CGAAGATAAC GGCTTTGTCA GCGGCGACCG GCTGACGCCG GAATTTCCGG GGCGGCGCCG GCCGCTGCGC GCCAAGGACG GCAAAGCCGT CACCCAGCTC GCCTATGCAC GGGCCGGCGT CATCACGCCG GAGATGGAGT TCGTGGCCAT TCGCGAGAAT CTCGGCCGCA AGGCGAAGGC CGAAGCCCTG GTGCGTGACG GCGAGAGCTT CGGCGCCGAT ATTCCCGATC ACGTGACGCC GGAATTCGTC CGCCGGGAAG TCGCCGCCGG CCGGGCGATC ATCCCGGCCA ACATCAACCA TCCCGAAAGC GAACCGATGA TCATCGGCCG GAACTTTCTG GTGAAGATCA ACGCCAATAT CGGCAACTCC GCCGTCACCT CATCGATGGC TGAGGAAGTC GAGAAGATGG TCTGGGCTGC CCGCTGGGGC GCCGATACGG TCATGGATCT GTCGACCGGC CGCAACATCC ACAATATCCG CGAATGGATC ATCCGCAATT CGCCGCTGCC GATCGGCACG GTGCCGCTCT ACCAGGCGCT GGAAAAGGTT GGCGGCATCG CCGAAGACCT CACCTGGGAG ATCTATCGCG ATACGCTGAT CGAGCAGGCC GAACAGGGCG TCGACTATTT CACCATCCAT GCCGGTGTCC GGCTGCATTA CATCCCACTC ACCGTCAATC GCGTCACCGG CATCGTCTCG CGCGGCGGTT CGATCATGGC CAAGTGGTGT CTGCATCATC ACCGCGAAAG CTTCCTCTAC GAGCATTTCG AGGAAATCTG CGATATCTGC CGGGCCTATG ACGTCTCCTT CTCGCTCGGC GACGGCCTTC GCCCCGGCTC GATCGCCGAT GCCAACGATG CGGCGCAGTT TGCCGAACTC GAAACGCTGG GCGAGTTGAC GAAAATCGCC TGGGCCAGGG ATTGCCAGGT GATGATCGAG GGACCTGGCC ATGTGCCGAT GCACAAGATC AAGGAAAACA TGGACAAGCA GCTCGCCGTC TGCGGCGAGG CGCCCTTCTA CACGCTGGGG CCGCTGACCA CCGATATCGC GCCCGGCTAC GACCATATCA CCTCCGGCAT CGGCGCGGCG ATGATCGGCT GGTTCGGCAC GGCCATGCTC TGCTACGTCA CCCCGAAAGA ACATCTTGGC CTGCCCGATC GCAACGACGT CAAGACCGGC GTCATCACCT ACAAGATCGC CGCCCATGCC GCCGACCTTG CCAAGGGGCA CCCGGCCGCC CGCCTGCGCG ACGACGCCCT GTCGCGGGCG CGCTTCGAAT TCCGTTGGGA AGACCAGTTC AACCTGTCGC TCGATCCCGA CACCGCCCGC AGCTTCCATG ACGAGACCCT GCCGAAGGAA GCGCACAAAG TGGCGCATTT CTGCTCGATG TGCGGCCCGA AATTCTGCTC GATGCGCATA TCGCACGACA TCCGCGCCGA AGCACAAAAG GAAGGGCTGG ATGCGATGGC GGCGAAATAC CGGGAGGGCG GCGATCTCTA TATGCCGATC GATACCCACG CCAACGGGGC GGAATGA
|
Protein sequence | MTIAVENLTP TVTTGPLPAS RKIHIPGDIH GDIRVPMREI SVHPTAGEPP VVVYDSSGPY TIEGAEIRIE QGLPQLRRDW VLARGDVEAY DGRHVRPEDN GFVSGDRLTP EFPGRRRPLR AKDGKAVTQL AYARAGVITP EMEFVAIREN LGRKAKAEAL VRDGESFGAD IPDHVTPEFV RREVAAGRAI IPANINHPES EPMIIGRNFL VKINANIGNS AVTSSMAEEV EKMVWAARWG ADTVMDLSTG RNIHNIREWI IRNSPLPIGT VPLYQALEKV GGIAEDLTWE IYRDTLIEQA EQGVDYFTIH AGVRLHYIPL TVNRVTGIVS RGGSIMAKWC LHHHRESFLY EHFEEICDIC RAYDVSFSLG DGLRPGSIAD ANDAAQFAEL ETLGELTKIA WARDCQVMIE GPGHVPMHKI KENMDKQLAV CGEAPFYTLG PLTTDIAPGY DHITSGIGAA MIGWFGTAML CYVTPKEHLG LPDRNDVKTG VITYKIAAHA ADLAKGHPAA RLRDDALSRA RFEFRWEDQF NLSLDPDTAR SFHDETLPKE AHKVAHFCSM CGPKFCSMRI SHDIRAEAQK EGLDAMAAKY REGGDLYMPI DTHANGAE
|
| |