Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_2045 |
Symbol | |
ID | 3757053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 2087402 |
End bp | 2089048 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637782933 |
Product | hypothetical protein |
Protein accession | YP_388537 |
Protein GI | 78357088 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCTCC CCCTGCCCAC ACCACAGGAA ATGAACCGGT GGGATCAGGC TGCCGTTGAC GAAGCGGGCA TGCCTGCCGA GATGCTGATG GAAAACGCCG CCCGCGAAGC TTTCAGAGTG CTGTGCGGCT GCTTTGAAAA ATCCTCGGCA ACGGGCGCGC ATACGGCGGA AGACCCTGCA TGGTGCCTTG CTGAAAAGCG CATCCTCTGT TTCATGGGTG GAGGAAACAA CGGCGGCGAT ACCGCCGCAC TGGCCAGACT TCTTTGCGGT GCCGGTGCCA CCGTGCTTGT GGTGCACACG CGCCCGCTTG AAGATTTCCG GCAGGAGGCG GCATGGCACA TAGAGCGTGC ACTTCAGGCG GGAGTGCTTT TTGTACAGTA TGACGCATCT TGTCCCGCAG TTATCCCCGC TGTCCTTTCC ACCAAGGAAT TATCAAACTG GAACGACGAC GCCTGCCCGG TGCCCGTCAT GCCGGACATC GTGGTTGACG GTCTGCTGGG AACGGGACTG CGGGGCGAAG TGTCAGATTC GCTGCTGGCA CTTATCCGGC ATATCAATGC GCTGGCACGG ACCGCATTTG TCTTTTCACT GGATATTCCC AGCGGTATGG ACGGACTGAC GGGGCTGCCC TCTCCGGAGT GCGTCTGTGC CCGCTGCACC GTCACATTTC AGGCTCCGAA GACAGGACTG CTTCACCCCG AGGCAAAGGA ATATACCGGC AGGCTGCACA TACGGGACAT CGGCATTCCT CCCGCTATCC AGAACAGATA TCCCACGCGA CTTCGCAGGG CCACATGTGC TGTTGCCGCA CTGCCCCCCC CGCCCTCGGC GGCCATGCAC AAGGGCACGG CAGGCCGTGT GCTGGTCATC GGCGGGTCCG CAGGGCTTTG CGGTGCTCCG CTGCTGGCGG CTGCCGGAGC CTTGCGCGCC GGTGCCGGAC TCGTCACGGT GGCCGTTCCT GCCCGCCTTG AAGCCTGTGT CAGACATGGC AATCCGGACA TCATGACATT GCCGCTTGCG TGCGGCGACG ACTGGGCCGC TTTTGACGCA CGCATGCTGG AAGGGCTGGA TGAGCGCTAC GACGCAGTCA TTGTGGGCAA CGGCATGGGG CGCTCGGCTC ACGCCGGCGA GGCGCTTGCG GAAATTTTGA AAACCCCCAG GCCTCCCTCC GTCATCGATG CGGACGCACT TTTCCATTTA CGCCACCCTA CGCAGATGCT TGCATTGATG CGTGAAACAG ATATTCTTAC ACCTCATCCG GGCGAGATGG CTTTTCTGAC GGGCCTGACC ATAGAGCAGG TACAGGCAGA CAGGCTTGCC GCGCTGGAAA TGCTGACGCG GCGCACAGCA GCAACCTGTA TTCTGAAGGG CGCAGGCACT CTGGTGGGCA GCACACAGTC ATTTACAGCG TTTATCGACG CCGGAGGGCC GTCACTGGCG GTGGGCGGCT CCGGCGATGT GCTTGCCGGT ATATGCGCCG CATTTGCGGC ACAGGGCCTT GGGGCATTCA ATGCCGCCGC CGCTGCGGCA TTCATCCACG GAAAAGCCGG AGAACTCCTG TTGCAGGAAT TTCCGTTGCG CGGTACTACC GCATCTGAAA TAGCTGATAC CATTCCCCGT ACCAGAAAGG AGCTTTATCA TGCTTAG
|
Protein sequence | MFLPLPTPQE MNRWDQAAVD EAGMPAEMLM ENAAREAFRV LCGCFEKSSA TGAHTAEDPA WCLAEKRILC FMGGGNNGGD TAALARLLCG AGATVLVVHT RPLEDFRQEA AWHIERALQA GVLFVQYDAS CPAVIPAVLS TKELSNWNDD ACPVPVMPDI VVDGLLGTGL RGEVSDSLLA LIRHINALAR TAFVFSLDIP SGMDGLTGLP SPECVCARCT VTFQAPKTGL LHPEAKEYTG RLHIRDIGIP PAIQNRYPTR LRRATCAVAA LPPPPSAAMH KGTAGRVLVI GGSAGLCGAP LLAAAGALRA GAGLVTVAVP ARLEACVRHG NPDIMTLPLA CGDDWAAFDA RMLEGLDERY DAVIVGNGMG RSAHAGEALA EILKTPRPPS VIDADALFHL RHPTQMLALM RETDILTPHP GEMAFLTGLT IEQVQADRLA ALEMLTRRTA ATCILKGAGT LVGSTQSFTA FIDAGGPSLA VGGSGDVLAG ICAAFAAQGL GAFNAAAAAA FIHGKAGELL LQEFPLRGTT ASEIADTIPR TRKELYHA
|
| |