Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_2331 |
Symbol | |
ID | 3757342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 2349438 |
End bp | 2351105 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637783222 |
Product | putative phenylpyruvate decarboxylase |
Protein accession | YP_388823 |
Protein GI | 78357374 |
COG category | [G] Carbohydrate transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes |
TIGRFAM ID | [TIGR03394] indolepyruvate/phenylpyruvate decarboxylase, Azospirillum family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00138899 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTGA CCGAAGCGTT GCTGCATGCT CTCAAAGAGC GTGGCGCCAA AGAAGTGTAC GGCATTCCGG GCGACTTTGC CCTGCCGTAT TTCAAGATAA TCGAATCCAC GGGTATTCTG CCGCTTGTCA CGCTGAGCCA CGAGCCCGCC GTGGGGTTTG CCGCCGACGC ATCCGCCCGC TTCAGAGGCG GTCTGGGGGT GGCCGCAGTG ACATACGGAG CCGGCGCATT GAACATGGTT AACGCCATAG CGCAGGCATA TGCGGAAAAG TCGCCTGTGG TTGTCATTTC CGGCGCGCCG GGAACCCACG AAGGCGGACA TGGGCTGCTG CTGCACCATC AGGCGAAGCA TCTTGACTCG CAGTACCGCA TGTACAAAGA AGTCACCTGC GATCAGGCGG TGCTGGACGA TCCGGCCACT GCGCCTGAAA CCATAGCCCG TGTGCTGCAG AGCTGTATTG AACATTCCCG CCCTGTGTAT CTTGAGATTC CGCGTGATCT GCCTGCGGCA CCGTGTGCGC CTGTGCCTGC TTTTGCTGCT TCCGGCACGG ATGCGGACGC CGTGGACGCC TGCGCCCGCG ACGTGCTGCA GCGCGTGCAG AGCGCCGCGC TGCCCATGAT GATGGTGGGG GTTGAGGTGC GGCGTTACGG CATAGAAGAC AAAGTGGCTC TGCTGGCCGA GCGGCTGGGG GTGCCTGTGG TGACCAGCTT TATGGGACGC GGGTTGCTAG CTTCGGCCAG CTGTCTGGAA GGAACCTATC TGGGCATGGC CGGTGACGAG ACACTCAGCG AGGCTGTGGA ATCTTCCGAC GGGCTTCTGC TGCTGGGTGT TATCATGTCG GATACCAATT TCGGCGTGTC CGGAAGCCAG CTGGACCGCA GGCGTGTGAT GCATGCGGAG GACAGACAGG TGCGTGTCGG CTACCATGTG TATCATGACA TTCCGCTGGG AGCGCTGGTG GACCGTCTGC TGGCATTGCT GGATGACGGA CAGGAAGCTG ACATGGCTCT TTCGGGCTGC GGCGGTTCTT CTGACGACAG GGCATTGCAG GGGGCGGGAC TCGCTGTGAA TCCGCGGGGG TTCATGGCAG ATGCCGCAGA TATCACTCCC ACGGATATAG CCGCGCTGCT TAATGATTTT TTTGCCTCGC ACGGAGCAAT GCCCGTTGCC AGCGATATAG GCGACTGCCT GTTCACCATG CTGAGCGTGG ATGCCATTCC CATGGTGGCA CCGGGGTATT ATGCCAGCAT GGGCTTCGGG GTGCCGGCGG GCATGGGGCT GCAGATAACG ACAAAGGAGC GTTCGCTCAT TCTGGTGGGC GACGGCGCTT TTCAGATGAC CGGTATGGAA CTGGGCAACT GCACGCGGCT GGGTATTGAT CCCATCGTGC TGGTGTTTAA TAATTCGTCG TGGGAAATGC TGCATGTCTT CCAGCCCGAG ACAGCATACA GCAATCTGGG CGAATGGGAT TTTGCCGCGG TGGCTGACGG TCTGGGCGGA CGGGGGCACA GGGCCGCCAC CCGTGCCGAG CTTGCACGTG CGCTGACACA GGCCCATGCC GAACGCGGAC GTTTTCAGCT GATTGACGCC CGTCTGGCTC CGCATAGTCT GTCCCCCACG CTGGCCCGTT TTGTGGAAGG TGTGCGCAGA TTCAGTTCGG GGCAATAA
|
Protein sequence | MNLTEALLHA LKERGAKEVY GIPGDFALPY FKIIESTGIL PLVTLSHEPA VGFAADASAR FRGGLGVAAV TYGAGALNMV NAIAQAYAEK SPVVVISGAP GTHEGGHGLL LHHQAKHLDS QYRMYKEVTC DQAVLDDPAT APETIARVLQ SCIEHSRPVY LEIPRDLPAA PCAPVPAFAA SGTDADAVDA CARDVLQRVQ SAALPMMMVG VEVRRYGIED KVALLAERLG VPVVTSFMGR GLLASASCLE GTYLGMAGDE TLSEAVESSD GLLLLGVIMS DTNFGVSGSQ LDRRRVMHAE DRQVRVGYHV YHDIPLGALV DRLLALLDDG QEADMALSGC GGSSDDRALQ GAGLAVNPRG FMADAADITP TDIAALLNDF FASHGAMPVA SDIGDCLFTM LSVDAIPMVA PGYYASMGFG VPAGMGLQIT TKERSLILVG DGAFQMTGME LGNCTRLGID PIVLVFNNSS WEMLHVFQPE TAYSNLGEWD FAAVADGLGG RGHRAATRAE LARALTQAHA ERGRFQLIDA RLAPHSLSPT LARFVEGVRR FSSGQ
|
| |