Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1824 |
Symbol | |
ID | 3918384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1926312 |
End bp | 1928192 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444566 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_497098 |
Protein GI | 87199841 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.484584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGACA TCAACTCCAA GCTGGAAATC GGCGTCACCA CCGGGCCGAT CCGCGGCAGC CGCAAGATCC ACGTGGAAAG CGCACGCTTT CCCGGCTTGA CGGTGGCGAT GCGCGAAATC CAGCTCGAGC CTTCGAGCGG CGAGCCGCCG GTACGCGTCT ACGATACGTC CGGTCCGTAC ACCGACCCCA AAGTGACCAT CGACATCGCG GCGGGCCTGC CCACTCTGCG TCGCGACTGG ATCATGGCGC GCGGCGACGT GGAAGAATAC GACGCGCGCG AAGTGAAGCC CGAGGACAAT GGCCTCAAGG GTCCGGATCG TTCTGCCGGG GTGCCGCCGT TCCCCAACGT CGTGAAGCGC CCTTTGCGTG CAAAGGCGGG CCAGAACGTC AGCCAGATGC ACTATGCCCG CCGCGGGATC ATCACGCCCG AGATGGAATA CGTGGCGATC CGCGAGAACC TTGGCCGCAA GCAGGCCAAG GAAGCGATGA TCCGTGATGG GCAGGACTGG GGTGCGTCGA TCCCCGATTA CGTGACGCCC GAATTCGTGC GCGACGAAGT GGCGCGTGGG CGGGCGATCA TCCCGTCGAA CATCAACCAC CCTGAAAGCG AACCGATGGC GATCGGCCGC AACTTCCTGG TGAAGATCAA CGCCAACATC GGCAATTCGG CTGTGGCATC CGACGTCGCG AGCGAAGTCG ACAAGATGGT CTGGTCGATC CGCTGGGGTG CGGATACCGT GATGGACCTT TCGACGGGGC GGAACATCCA CGACACCCGC GAATGGATCC TGCGCAATTC GCCGGTGCCG ATCGGCACGG TGCCGATCTA CCAGGCGCTC GAAAAAGTCG GCGGCGTGGC CGAGGACCTG ACCTGGGAAG TGTTCCGCGA CACGCTGATC GAACAGGCCG AGCAGGGCGT GGACTACTTC ACCATCCATG CCGGCGTGCG CCTTCCCTAC ATCCCGCTTG CCGCCAAGCG CATGACCGGG ATCGTCAGCC GTGGCGGTTC GATCATGGCC AAGTGGTGCC TTGCGCATCA CAAGGAGTCG TTCCTTTACG AGAACTTCGA CGAGATCACC GAGATCATGA AGGCCTATGA CGTGGCCTAT TCGCTGGGCG ATGGCCTGCG TCCCGGATCG ATCTACGACG CCAACGACGA AGCACAGTTC GCCGAACTCT ACACGCTGGG CGAACTGACC AAGCGTGCCT GGGAACAGGA CGTGCAGGTG ATGATCGAGG GCCCCGGCCA CGTGCCGATG CACAAGATCA AGGAGAACAT GACGAAGCAG CTCGAGGCGT GCGGCGAAGC GCCGTTCTAC ACGCTCGGGC CGCTCGTCAC CGACATCGCG CCGGGATACG ACCACATCAC CAGCGGCATC GGTGCGGCAC AGATCGGCTG GTACGGCACG GCGATGCTCT GTTACGTCAC GCCCAAGGAG CATCTCGGCC TGCCGGATCG CGATGACGTG AAGGTCGGCG TGGTGACCTA CAAGCTGGCA GCCCACGCCG CCGACCTCGC CAAGGGCCAC CCTGCGGCAC AGGCGCGCGA TGACGCGCTG AGCAAGGCGC GCTTCGAGTT CCGCTGGCGC GACCAGTTCA ACCTGTCGCT CGATCCGGAA ACCGCCGAGC AGTACCACGA CCAGACGCTT CCGGCGGAAG GCGCAAAGAC CGCGCACTTC TGCTCGATGT GCGGTCCGAA GTTCTGCTCG ATGAAGATCA GCCAGGAAGT TCGCGACTTT GCCGCCAAGC AGAATGCCGG CATCGAGACG TTTGTCGCCA ACGAGGCCGA AGCCGAGGCC GGCATGAAGG CGATGAGCGA CAAGTACGAC GAGATGGGCC GCGAACTGTA CATCGGCGCG GGCGGGCGCG AGCACGACTG A
|
Protein sequence | MADINSKLEI GVTTGPIRGS RKIHVESARF PGLTVAMREI QLEPSSGEPP VRVYDTSGPY TDPKVTIDIA AGLPTLRRDW IMARGDVEEY DAREVKPEDN GLKGPDRSAG VPPFPNVVKR PLRAKAGQNV SQMHYARRGI ITPEMEYVAI RENLGRKQAK EAMIRDGQDW GASIPDYVTP EFVRDEVARG RAIIPSNINH PESEPMAIGR NFLVKINANI GNSAVASDVA SEVDKMVWSI RWGADTVMDL STGRNIHDTR EWILRNSPVP IGTVPIYQAL EKVGGVAEDL TWEVFRDTLI EQAEQGVDYF TIHAGVRLPY IPLAAKRMTG IVSRGGSIMA KWCLAHHKES FLYENFDEIT EIMKAYDVAY SLGDGLRPGS IYDANDEAQF AELYTLGELT KRAWEQDVQV MIEGPGHVPM HKIKENMTKQ LEACGEAPFY TLGPLVTDIA PGYDHITSGI GAAQIGWYGT AMLCYVTPKE HLGLPDRDDV KVGVVTYKLA AHAADLAKGH PAAQARDDAL SKARFEFRWR DQFNLSLDPE TAEQYHDQTL PAEGAKTAHF CSMCGPKFCS MKISQEVRDF AAKQNAGIET FVANEAEAEA GMKAMSDKYD EMGRELYIGA GGREHD
|
| |