Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_0797 |
Symbol | |
ID | 3755753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 816685 |
End bp | 817800 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637781662 |
Product | hypothetical protein |
Protein accession | YP_387293 |
Protein GI | 78355844 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.106816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCA CTCTGACCCC GTTTGACGAA CCGCGCCAAG TGCTGGACAT CTGCGCCAAA GTGCTGGACG GCAGCCGCAT AAACGAAGCC GAAGCGGCAA CCCTGTACGA CAGGGCAAAC CTGTTCACGC TGGGACAGCT GGCCCATGCC GTGCGCATGG CAAAGCACCC CACAGCCGTT GTCACCTATG TGGCCGACCG CAACATCAAT TATTCCAACA TCTGCGTGTG TGCCTGCCGG TTCTGCGCTT TTTACGCCCC GCCCGGACAC GAAGACGGCT ATGTGCTGAC CCGCGAAGAA CTGGGCCGCA AAATTGAAGA AACGCTGGAA CTGGGCGGCA CGCAGATTCT GCTGCAGGGC GGGCACCACC CCGACCTGCC CCTTGCGTTT TACGAAGACA TGATCCGCTG GATTGCGGAA ACATACCCCG CCATCCATAT CCATGCTTTT TCACCGCCTG AAATTGTCTT TTTTTCCGAA CTGGAAGGCG TAACCATAGC CGAAGTCATC GAAAGACTGC GCGCCGCAGG GCTGCATTCC ATTCCCGGCG GCGGCGCGGA AATTCTGGTG GACGAAGTGC GCGCACGCGT GGCTCCCAAC AAATGCAGCG CCTCGCGCTG GCTTGCCGTA ATGGAAGAGG CGCACTATCA GGGGCTGCGC ACCACCGCCA CCATGATGTT CGGACACGAA GAAGAACCCC GCCACCGGCT TGAACATCTC TTCGCTCTGC GCGAGGTACA GGACAGAACC GGCGGCTTCA CCGCCTTCAT CCCGTGGACA TTCCAGCCCG GCAACACCAA CATCCAGCGC GACACCGAGC CTTCGCCCGC CTATCTGCGC ATGCTGGCCA CATCACGCAT CGTGCTGGAC AACTTTGACA ATGTGCAGGC TTCATGGGTC ACCATGGGCC CGCAGGTGGC GCAGCTGGCA CTGCACTTCG GGGCAAACGA CTTCGGCTCG CTGATGATTG AAGAAAACGT GGTGGCCGCC GCAGGTGTAA GCTTCCGCAT GTCGCGGCAG GAAATCCACA ACGTCATCCG CGCTGCGGGA TTTGTGCCCC GCCAGCGCAC CATGGATTAC ACCTACGTGG AAAACAGGCC GGAGGGCGAC GCATGA
|
Protein sequence | MNATLTPFDE PRQVLDICAK VLDGSRINEA EAATLYDRAN LFTLGQLAHA VRMAKHPTAV VTYVADRNIN YSNICVCACR FCAFYAPPGH EDGYVLTREE LGRKIEETLE LGGTQILLQG GHHPDLPLAF YEDMIRWIAE TYPAIHIHAF SPPEIVFFSE LEGVTIAEVI ERLRAAGLHS IPGGGAEILV DEVRARVAPN KCSASRWLAV MEEAHYQGLR TTATMMFGHE EEPRHRLEHL FALREVQDRT GGFTAFIPWT FQPGNTNIQR DTEPSPAYLR MLATSRIVLD NFDNVQASWV TMGPQVAQLA LHFGANDFGS LMIEENVVAA AGVSFRMSRQ EIHNVIRAAG FVPRQRTMDY TYVENRPEGD A
|
| |