Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_2216 |
Symbol | |
ID | 3757226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 2256015 |
End bp | 2257016 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637783106 |
Product | extracellular solute-binding protein |
Protein accession | YP_388708 |
Protein GI | 78357259 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.36837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACTGG CCGTATGTGC GGTCTTGCTT GGAGTGTCCG GCTTTGCTTC TGTATGTCTG GGGCAGACGT ACAGGCTGAC ATACAGCAGC TTTTTTCCGC CCTCGCATGT GCAGTCCGTA CTGGCGGAAG AGTGGGCGCG GGAAGTTGAA AAAAGAACAG CAGGCGCTGT CGTCATTGAT TTTTATCCTG CCGGAACCCT GACAGGGGCA AGGCAGGCCT ACGACGGCGT GGTGCAGGGG ATTTCTGACA TCGGGCTCTC CGCGCTGGCG TATTCCAGAG GACGTTTTCC GCTTATGGAA GCCGTGGACC TGCCGCTGGG GTATACCAGC GGGGCGCAGG CCACACGGGT TGCAAACAGC GTGTACAGTC ACTTTGTGCC GCGCGAACTG CAGGACGTGC ATGTGCTGTA TTTTCATGCC CATGGTCCCG GGCTGCTGCA TACGCGGCAA AAGGCGGTGC GCTCGCTGGA AGATATGCAG GGCTTGAAGC TGCGTGCCAC AGGAAATTCC GCCAGTGTGG TCAAAGCACT GGGCGGGACG CCCGTGGCCA TGTCGATGCC TGAATCGTAT CAGTCCATCC AGCGCGGCGT GGTTGACGGC GGCATGTATC CCGCCGAAAC AAACAAAGGC TGGAAAATGG CGGAAGTTGT TGATTACTGC ACGGAGGCCG TGCCCGTGGC GTATACGACA ACGTTTTTTG TGGTGATGAA TAAAGACCGG TGGGAATCTC TGCCCGGTGA GGTGCAGGAG ACCATCACCT GGATAAGCCG CGAGTGGGCA CCGCGCCATG GTGCGGCGTG GGACGAGAGC GACGCAGAAG GCAGGGCGTT TTTTGCAGCG CAGGGCAACA GCTTCATCAC GCTGGAAGAG GCCGAGATGG CACGCTGGAA ACAGGCTGTC GAGCCGGTGG TTCAGGAGTA TGTGCGGCAG GCCGCAAAGC GTGGTGTGGA TGCAGAGGGC GTGGTGGAGT TTATCCGCAA TGAGCTTGCA AGCGGCCGTT GA
|
Protein sequence | MLLAVCAVLL GVSGFASVCL GQTYRLTYSS FFPPSHVQSV LAEEWAREVE KRTAGAVVID FYPAGTLTGA RQAYDGVVQG ISDIGLSALA YSRGRFPLME AVDLPLGYTS GAQATRVANS VYSHFVPREL QDVHVLYFHA HGPGLLHTRQ KAVRSLEDMQ GLKLRATGNS ASVVKALGGT PVAMSMPESY QSIQRGVVDG GMYPAETNKG WKMAEVVDYC TEAVPVAYTT TFFVVMNKDR WESLPGEVQE TITWISREWA PRHGAAWDES DAEGRAFFAA QGNSFITLEE AEMARWKQAV EPVVQEYVRQ AAKRGVDAEG VVEFIRNELA SGR
|
| |