Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_1982 |
Symbol | |
ID | 6026454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 2085301 |
End bp | 2086827 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641594803 |
Product | extracellular solute-binding protein |
Protein accession | YP_001718105 |
Protein GI | 169832123 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.204968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCGGC GCCTGCTTGC CCTGCTGGGG CTGGGTTTCA TCCTGACCGC CTTGTTCATT GTGGCCGGCC AACTCGAAGA TGGCGGCAAA AAGGAGCGGC TGACCTACGC CCTGGCCCGT TACCCCGCCA CCCTGGACCC AACGGCCGTT ACGGACGAGT CGGGAGCCGC CGTGCTCTTG AACCTCTACG AGGGCCTCGT GCGCTTCGAA CCAGGAGGCA CCGGGATTGA ACCCGCGCTG GCCCGGGACT GGAACGTATC ACCCGACGCC CGGACTTGGA CTTTTTATCT CCAGGAAGAC ATATCTTTCA CCGACGGCAC CCCGCTGGAC GCCGCGGCGG TAAGAGACGC GGTCGAACGG CAGCTCAACC CCGAAACCGC CGGACCATAC GCTTCCTTTG TATACGGGCC GGTGACGCGG ATCGAAACCA AGGGCCGCCA CACGGTCATT TTTCACCTGA AGCACCCGTA CGCCCCGTTC ATCAGGAACC TGGCAATGCT TCCGGCGGCG GTCGTCCGCC CTTCCCCCGA CCACGGCCTG CCCATCGGCA CCGGCCCTTT CGTCCCGTCC GCCATTGAGT CGGCCCGGAT CACTCTGAAA GCCAATCCCG CTTACCGGGA AGGGCCGCCG CACCTGAAGG AAGTCCTTTT CGTAGTCATT CCCGATCCGC ATGAACGGTG GCGGGCACTG GCCCAAGGCC GGGTGGACGT GGCGGAAAAC ACTGGGGCCG CCCTGCCGGC CACAGGACCG GATAGCCTAG TCATCGCCCG GACGCCCGGG CTGGACCTGA GTTACCTAGC ATTCTATACC AACAAAAAGC CCTTTGACAA TCCCGCCGTA CGGCGGGCGG CAAGCCTCGC CGTCAATCAG CAGGCCATTG TGGACTACCT CTTTCCGGAC CGGGCTGTGC CTGCTATCGG ACCCCTGCCC CCCGGTACCC TGGGTCACCA CCCCACCCTG GGCGCGGACG CTTACAACCT GGAGGAAGCC CGGCAGCTCC TGGACCAAGC GGGTTACAGC GGTGAGGAAA TCACGCTGAT CACCTACCAG GACCGGCGCC CCTACAACCC GGCGGGCGGG GAGAAACTGG CCCACCTTCT GGTTGAACAG CTCGCCCAGG CCGGTTTTAA GGTGCGGGTG GAGGCCTACC CTTGGGAGAT CTGCAAGCAC GCCATCCACC GCCAGGAGGG GCACGCTTTC GTCTTCGGCT GGGTCGGGGA TAACGGGGAC CCGGACAATT TCCTATACAC CCTGCTGGCC AGCGCGCAGA TCCAAACCGG CACCAACGCG GCACGCTACT CCAACCCGCA TGTCGACATG CTGCTCGGCC GGGCCCAGCA GGTGACCGAC GAAGCGCTGC GCGAACGCTT GTACCGCCAA GCCCAGGAGC TTATTGCCGC CGATGCTCCG TGGGTATTCC TGAACCATCG GCTCGAAACG GCGGCGCACC ACCCCACGGT GAAAAATCTG GTGGTGCAGC CCACCGGGGG CGCCTATCTG GCCCAGGTGC GCAAGGACGA CCAGTAA
|
Protein sequence | MHRRLLALLG LGFILTALFI VAGQLEDGGK KERLTYALAR YPATLDPTAV TDESGAAVLL NLYEGLVRFE PGGTGIEPAL ARDWNVSPDA RTWTFYLQED ISFTDGTPLD AAAVRDAVER QLNPETAGPY ASFVYGPVTR IETKGRHTVI FHLKHPYAPF IRNLAMLPAA VVRPSPDHGL PIGTGPFVPS AIESARITLK ANPAYREGPP HLKEVLFVVI PDPHERWRAL AQGRVDVAEN TGAALPATGP DSLVIARTPG LDLSYLAFYT NKKPFDNPAV RRAASLAVNQ QAIVDYLFPD RAVPAIGPLP PGTLGHHPTL GADAYNLEEA RQLLDQAGYS GEEITLITYQ DRRPYNPAGG EKLAHLLVEQ LAQAGFKVRV EAYPWEICKH AIHRQEGHAF VFGWVGDNGD PDNFLYTLLA SAQIQTGTNA ARYSNPHVDM LLGRAQQVTD EALRERLYRQ AQELIAADAP WVFLNHRLET AAHHPTVKNL VVQPTGGAYL AQVRKDDQ
|
| |