Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_3921 |
Symbol | |
ID | 3567652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | - |
Start bp | 4215811 |
End bp | 4217727 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637682395 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_287119 |
Protein GI | 71909532 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCA CCGAACAATT CCTCGCCGCC AACGCCCACG TCGACGAGGC TGCAGTCCAG CCGCTGCCCA ATTCCCGCAA AATCTACGTT GAAGGTTCGC GCCCGGATAT TCGGGTGCCG ATGCGCGAAG TGTCGCAGGA CGACACGCCG ACCGCCTTCG GTGGAGAAAA GAACCCGCCG ATCTACGTCT ATGACTGCTC CGGCCCCTAT TCCGACCCGG CCGCCAAGAT CGACATCCGT TCCGGCCTGC CGGCGCTGCG TGCCCAGTGG ATTGCCGAGC GCGGCGATGT CGAGGCGCTG GCCGATTTGA GTTCCGAGTT CGGCCGCCAG CGTGCCGCCG ACCCAAAACT CGACGAACTG CGCTTCCCCG GCCTGCACCG CAAGCCGCTG CGCGCCAAGG CCGGTCAGAA CGTTTCGCAG ATGCACTACG CCCGCCGGGG CATCATCACG CCGGAGATGG AATACGTCGC CATCCGCGAG AACAACAACC GCCGCGCTTA CATTGAAAGC CTGAAGGCCA CCGGCCCGAT GGGTAACCGG ATGGCCGACA TTCTCGGCCG CCAGCACAAG GGCCAGGATT TCGGCGCCAG CATTCCGGAA GAAATCACCC CGGAATTCGT CCGCAGCGAA ATCGCCCGCG GCCGCGCCAT CATCCCGAAC AACATCAACC ACCCGGAAAG CGAGCCGATG ATCATCGGCC GCAATTTCCT GGTCAAGATC AATGCCAACA TCGGCAACTC GGCGCTCGGC TCCTCGATTC AGGAAGAAGT CGAAAAGATG ACCTGGTCGA TCCGCTGGGG CGGCGACACG GTGATGGACC TGTCCACCGG CAAGAACATT CACGAAACGC GTGAATGGAT CATCCGTAAC AGCCCGGTGC CAATCGGCAC GGTGCCGATC TATCAGGCGC TGGAAAAGGT TAACGGCAAG GCCGAAGACC TGACCTGGGA AATCTTCCGC GACACACTGA TCGAACAGGC CGAACAGGGC GTCGACTACT TCACCATCCA CGCCGGCGTC TTGCTCCGCT ATGTGCCGAT GACCGCCAAC CGCCTGACCG GCATCGTTTC CCGCGGTGGC TCGATCATGG CCAAGTGGTG TCTGGCCCAT CACAAGGAAA GCTTCCTCTA CACGCATTTC GAGGAAATCT GCGAAATCAT GAAGGCCTAC GACGTCGCCT TCAGCCTCGG CGACGGCCTG CGTCCCGGTT CTATCTACGA TGCCAACGAC GAAGCGCAAC TCGGCGAATT GGAGACGCTG GGCGAACTGA CCAAAATTGC CTGGAAGCAC GACGTTCAGG TCATCATCGA AGGCCCGGGT CATGTGCCGA TGCACATGAT CAAGGAAAAC ATGGACCTCC AGCTCAAGCA TTGCGACGAA GCTCCGTTCT ACACCCTCGG TCCCTTGACC ACCGATATTG CACCGGGCTA CGACCACATC ACCAGCGGCA TCGGTGCCGC GATGATCGGC TGGTACGGCA CTGCCATGCT CTGTTACGTC ACGCCGAAAG AGCACCTTGG CCTGCCCGAC AAGGATGACG TCAAGGAAGG CATCATCACC TACAAGCTCG CCGCCCACGC CGCCGACCTC GCCAAGGGCC ATCCCGGCGC GCAGATCCGC GACAACGCGC TTTCCAAGGC ACGCTTCGAA TTCCGCTGGG ATGACCAGTT CAACCTCGGC CTCGACCCGG ACAAGGCGCG CGAATTCCAC GACGAAACCC TGCCCAAGGA ATCAGCCAAG GTCGCTCACT TCTGCTCCAT GTGCGGCCCG CACTTCTGTT CGATGAAGAT CACCCAGGAA GTCCGGGAGT TCGCTGCACA GCAAGGGCTG GATGAAGCGG CTGCCCTGGA GAAGGGGATG GAAGTGAAAT CGGTTGAGTT TGTGAAGGCC GGCGCTGAGG TTTATAGCAA GATCTAA
|
Protein sequence | MNATEQFLAA NAHVDEAAVQ PLPNSRKIYV EGSRPDIRVP MREVSQDDTP TAFGGEKNPP IYVYDCSGPY SDPAAKIDIR SGLPALRAQW IAERGDVEAL ADLSSEFGRQ RAADPKLDEL RFPGLHRKPL RAKAGQNVSQ MHYARRGIIT PEMEYVAIRE NNNRRAYIES LKATGPMGNR MADILGRQHK GQDFGASIPE EITPEFVRSE IARGRAIIPN NINHPESEPM IIGRNFLVKI NANIGNSALG SSIQEEVEKM TWSIRWGGDT VMDLSTGKNI HETREWIIRN SPVPIGTVPI YQALEKVNGK AEDLTWEIFR DTLIEQAEQG VDYFTIHAGV LLRYVPMTAN RLTGIVSRGG SIMAKWCLAH HKESFLYTHF EEICEIMKAY DVAFSLGDGL RPGSIYDAND EAQLGELETL GELTKIAWKH DVQVIIEGPG HVPMHMIKEN MDLQLKHCDE APFYTLGPLT TDIAPGYDHI TSGIGAAMIG WYGTAMLCYV TPKEHLGLPD KDDVKEGIIT YKLAAHAADL AKGHPGAQIR DNALSKARFE FRWDDQFNLG LDPDKAREFH DETLPKESAK VAHFCSMCGP HFCSMKITQE VREFAAQQGL DEAAALEKGM EVKSVEFVKA GAEVYSKI
|
| |