Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2427 |
Symbol | |
ID | 5695275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 2930646 |
End bp | 2931926 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641265033 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001530308 |
Protein GI | 158522438 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00630295 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAGG TGGCCAGGGC AAAACAGGGC GTGGTAACCG AACAAATGGC GGCCGTTGCC AAAAGCGAGG GCCTTTCACC CGAGGCGGTG CGGGATGGCG TGGCCGCGGG CCGCATCGTG ATTCCCGGCA ATGTTAACCG CCGGTTTGCC CCGGTGGGCA TCGGCTCCGG GCTGCGCACC AAGGTGAACG CCAACATCGG CACCTCTCCG GAGCATCATG ATGTTGCCGA AGAGGAGCGC AAGCTTCAGA CCGCCGTTGC CGCCGGCGCC CACAGCGTGA TGGACCTTTC CACCGGCGGT GACCTTTTTG CGGTACGGAA AATGGTGCTG GAAAAATCCC CGGTGATGGT GGGGGCCGTG CCCATCTACG AGGTGGCGGC CCGGCTCAGC GCTGAATCGC GGGCCTTTTA TGAAATGACG CCGGACATGC TGTTTGACGC TATTGAGCGG CAGTGCGCCG AAGGCCTGGA CTACATAACG GTCCACTGCG GGGTGACCCG GCAGGCCGCG GCCCTGGCCG GGGCCGACAG GCGGGTGACC GGCATTGTCA GCCGGGGCGG TTCCCTGCTG GCGGCCTGGA TGCATGATCA CCGGAAGGAA AATCCCCTGT ATGAAGAGTT TGATCGGCTG CTCGGGATTG CCGCGGCCTA TGACGTGACC CTGAGCCTGG GAGACGGCCT GCGCCCCGGC GCCGTGGCCG ATGCATCGGA TGGGGCCCAG CTTCAGGAAC TGGTGGTGCT GGGAGACCTT GCCCGGCGGG CCCGGGACGC CGACGTTCAG GTCATGATTG AGGGGCCGGG CCATGTGCCC ATTGACCAGA TCGGTGCCAA CGTTCAAATG CAGAAAGCCC TGTGCGGCGG GGCCCCTTTT TACGTTCTGG GGCCCCTGAC CACGGACTGC GCGCCGGGGT ATGATCACAT CACCGGCGCC ATCGGCGGCG CCGTGGCAGC GGCTGCCGGC GCCGATTTTC TCTGCTATGT AACCCCGTCG GAACATCTCT GTCTGCCGGA CATCGACGAT GTGCGCGTGG GCGTGATTGC CGCTCGAATC GCGGCCCACT CCGGAGACAT TGCCAAAAAG GTGCCCGGCG CCCTGGACCG CGATATGCAA ATGTCGGCCT GCCGCAAGGC ACTGGACTGG CGCGGCATGT ACCAGGCCGC CATTGACCCC TGCCTGCCCC GACAGCGGCG GGAGGCAAGC CATCCTGAGG AAGAATCGGT GTGCACCATG TGCGGGGAGC TCTGCGCCGT CAAGACCCAT AACCGGATGA CAAATCCGTA G
|
Protein sequence | MTQVARAKQG VVTEQMAAVA KSEGLSPEAV RDGVAAGRIV IPGNVNRRFA PVGIGSGLRT KVNANIGTSP EHHDVAEEER KLQTAVAAGA HSVMDLSTGG DLFAVRKMVL EKSPVMVGAV PIYEVAARLS AESRAFYEMT PDMLFDAIER QCAEGLDYIT VHCGVTRQAA ALAGADRRVT GIVSRGGSLL AAWMHDHRKE NPLYEEFDRL LGIAAAYDVT LSLGDGLRPG AVADASDGAQ LQELVVLGDL ARRARDADVQ VMIEGPGHVP IDQIGANVQM QKALCGGAPF YVLGPLTTDC APGYDHITGA IGGAVAAAAG ADFLCYVTPS EHLCLPDIDD VRVGVIAARI AAHSGDIAKK VPGALDRDMQ MSACRKALDW RGMYQAAIDP CLPRQRREAS HPEEESVCTM CGELCAVKTH NRMTNP
|
| |