Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3201 |
Symbol | |
ID | 5198205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 3518084 |
End bp | 3519943 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640582747 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001263686 |
Protein GI | 148556104 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.814938 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATA TTCCCGCGCG CACCGAGATG AAGGTGACGA CCGGTCCGAT CCGGGGATCG CGCAAGATCC ATGTCGAGGG GCCTCAGGGC GTCCGCGTCG CGATGCGCGA GATCGCGCTC GAGCCGTCGT CGGGCGAGCC GCCGGTCCGC GTCTATGACT GCTCCGGCCC CTATACCGAT CCCAACGCCC ATATCGACAT CATGGCCGGC CTGCCCGCGC TGCGCCGCGA CTGGATTCTC GGGCGCGGCG ACGTCGAGGA CTATGAGGGC CGCGCGATCA AGCCCGAGGA CAACGGCCTG AAGGGGCCGG ACCGGTCGGG CGGCGTCACC CCCTTCCCGA ACGTGGTGCG GCGCCCGCTG CGCGCGAAGG CAGGGCAGAA TGTCAGCCAG ATGCACTATG CCCGGCGCGG CATTATCACG CCCGAGATGG AATATGTCGC GATCCGCGAG AATGTCGGCC GCGCCGCGCT CAAGGAGAAG CTGCTGCGCG ACGGCGAGGA TTTCGGCGCG GCGATCCCCG ACTTCGTCAC CCCCGAATTC GTCCGCGACG AGGTGGCGCG CGGCCGGGCG ATCATCCCCA GCAACATCAA CCATCCCGAA TCGGAGCCGA TGGCGATCGG CCGCAACTTC CTGGTGAAGA TCAACGCCAA TATCGGCAAT TCGGCGGTCG CTTCCGACGT CGCGGCCGAG GTCGACAAGA TGGTCTGGTC GATCCGCTGG GGCGCCGACA CGGTGATGGA CCTGTCGACC GGCCGCAACA TCCACGACAC GCGCGAATGG ATCCTGCGCA ACTCGCCGGT CCCGATCGGC ACGGTGCCGA TCTACCAGGC GCTGGAGAAG GTCGGCGGCA TCGCCGAGGA CCTGACCTGG GAAATCTTCC GCGACACGCT GATCGAGCAG GCCGAGCAGG GCGTCGACTA TTTCACCATC CATGCCGGCG TCCGCCTGCC CTTCATCCCA ATGACGGCCA AGCGCGTCAC CGGCATCGTC AGCCGGGGCG GATCGATCAT GGCGAAATGG TGCCTGGCGC ACCATCGCGA GAGCTTCCTC TACGAGCGGT TCGACGAGAT CTGCGAGATC ATGAAGGCCT ATGACGTCGC CTTCTCGCTG GGCGACGGCC TGCGCCCCGG ATCGATCGCC GACGCCAATG ACGAGGCGCA GTTCAGCGAA TTGAAGACGC TGGGCGAGCT GACCCAGATC GCCTGGCAGC ACGACGTGCA GGTGATGATC GAGGGCCCCG GCCATGTGCC GATGCACAAG ATCAAGGCGA ACATGGACAA GCAGCTCGAA GCCTGCGGCG AGGCGCCCTT CTACACGCTC GGGCCGCTGA CCACCGATAT CGCGCCGGGC TATGATCACA TCACCTCGGC GATCGGCGCG GCGATGATCG GCTGGTTCGG CACGGCGATG CTCTGCTACG TCACGCCGAA GGAGCATCTG GGCCTGCCCG ACCGCGACGA CGTGAAGGTC GGCGTGGTCA CCTACAAGCT CGCCGCCCAC GCCGCCGACC TGGCGAAGGG CCACCCGGCC GCCAAGCTGC GCGACGACGC GCTCAGCCGC GCCCGCTTCG AGTTCCGCTG GCGCGACCAG TTCAACCTGT CGCTCGATCC CGACACGGCC GAGCAGTATC ACGACCAGAC GCTGCCGGCC GAAGGCGCCA AGACCGCGCA TTTCTGCTCG ATGTGCGGGC CGAAATTCTG CTCGATGAAG ATCACCCAGG AGGTCCGCGA CTTCGCCGCC ACCAGGAACG CCCCTGCCGA CCAGTTCATC GCGGCCGGCG ACGCCGAGGC GGGCATGCGC GCGATGAGCG ACGTGTTCCG CGAGAAGGGC GGGGAGATCT ATCTGCCGGC GGCGGAGTAG
|
Protein sequence | MADIPARTEM KVTTGPIRGS RKIHVEGPQG VRVAMREIAL EPSSGEPPVR VYDCSGPYTD PNAHIDIMAG LPALRRDWIL GRGDVEDYEG RAIKPEDNGL KGPDRSGGVT PFPNVVRRPL RAKAGQNVSQ MHYARRGIIT PEMEYVAIRE NVGRAALKEK LLRDGEDFGA AIPDFVTPEF VRDEVARGRA IIPSNINHPE SEPMAIGRNF LVKINANIGN SAVASDVAAE VDKMVWSIRW GADTVMDLST GRNIHDTREW ILRNSPVPIG TVPIYQALEK VGGIAEDLTW EIFRDTLIEQ AEQGVDYFTI HAGVRLPFIP MTAKRVTGIV SRGGSIMAKW CLAHHRESFL YERFDEICEI MKAYDVAFSL GDGLRPGSIA DANDEAQFSE LKTLGELTQI AWQHDVQVMI EGPGHVPMHK IKANMDKQLE ACGEAPFYTL GPLTTDIAPG YDHITSAIGA AMIGWFGTAM LCYVTPKEHL GLPDRDDVKV GVVTYKLAAH AADLAKGHPA AKLRDDALSR ARFEFRWRDQ FNLSLDPDTA EQYHDQTLPA EGAKTAHFCS MCGPKFCSMK ITQEVRDFAA TRNAPADQFI AAGDAEAGMR AMSDVFREKG GEIYLPAAE
|
| |