Gene Daud_1982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1982 
Symbol 
ID6026454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2085301 
End bp2086827 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content65% 
IMG OID641594803 
Productextracellular solute-binding protein 
Protein accessionYP_001718105 
Protein GI169832123 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.204968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCGGC GCCTGCTTGC CCTGCTGGGG CTGGGTTTCA TCCTGACCGC CTTGTTCATT 
GTGGCCGGCC AACTCGAAGA TGGCGGCAAA AAGGAGCGGC TGACCTACGC CCTGGCCCGT
TACCCCGCCA CCCTGGACCC AACGGCCGTT ACGGACGAGT CGGGAGCCGC CGTGCTCTTG
AACCTCTACG AGGGCCTCGT GCGCTTCGAA CCAGGAGGCA CCGGGATTGA ACCCGCGCTG
GCCCGGGACT GGAACGTATC ACCCGACGCC CGGACTTGGA CTTTTTATCT CCAGGAAGAC
ATATCTTTCA CCGACGGCAC CCCGCTGGAC GCCGCGGCGG TAAGAGACGC GGTCGAACGG
CAGCTCAACC CCGAAACCGC CGGACCATAC GCTTCCTTTG TATACGGGCC GGTGACGCGG
ATCGAAACCA AGGGCCGCCA CACGGTCATT TTTCACCTGA AGCACCCGTA CGCCCCGTTC
ATCAGGAACC TGGCAATGCT TCCGGCGGCG GTCGTCCGCC CTTCCCCCGA CCACGGCCTG
CCCATCGGCA CCGGCCCTTT CGTCCCGTCC GCCATTGAGT CGGCCCGGAT CACTCTGAAA
GCCAATCCCG CTTACCGGGA AGGGCCGCCG CACCTGAAGG AAGTCCTTTT CGTAGTCATT
CCCGATCCGC ATGAACGGTG GCGGGCACTG GCCCAAGGCC GGGTGGACGT GGCGGAAAAC
ACTGGGGCCG CCCTGCCGGC CACAGGACCG GATAGCCTAG TCATCGCCCG GACGCCCGGG
CTGGACCTGA GTTACCTAGC ATTCTATACC AACAAAAAGC CCTTTGACAA TCCCGCCGTA
CGGCGGGCGG CAAGCCTCGC CGTCAATCAG CAGGCCATTG TGGACTACCT CTTTCCGGAC
CGGGCTGTGC CTGCTATCGG ACCCCTGCCC CCCGGTACCC TGGGTCACCA CCCCACCCTG
GGCGCGGACG CTTACAACCT GGAGGAAGCC CGGCAGCTCC TGGACCAAGC GGGTTACAGC
GGTGAGGAAA TCACGCTGAT CACCTACCAG GACCGGCGCC CCTACAACCC GGCGGGCGGG
GAGAAACTGG CCCACCTTCT GGTTGAACAG CTCGCCCAGG CCGGTTTTAA GGTGCGGGTG
GAGGCCTACC CTTGGGAGAT CTGCAAGCAC GCCATCCACC GCCAGGAGGG GCACGCTTTC
GTCTTCGGCT GGGTCGGGGA TAACGGGGAC CCGGACAATT TCCTATACAC CCTGCTGGCC
AGCGCGCAGA TCCAAACCGG CACCAACGCG GCACGCTACT CCAACCCGCA TGTCGACATG
CTGCTCGGCC GGGCCCAGCA GGTGACCGAC GAAGCGCTGC GCGAACGCTT GTACCGCCAA
GCCCAGGAGC TTATTGCCGC CGATGCTCCG TGGGTATTCC TGAACCATCG GCTCGAAACG
GCGGCGCACC ACCCCACGGT GAAAAATCTG GTGGTGCAGC CCACCGGGGG CGCCTATCTG
GCCCAGGTGC GCAAGGACGA CCAGTAA
 
Protein sequence
MHRRLLALLG LGFILTALFI VAGQLEDGGK KERLTYALAR YPATLDPTAV TDESGAAVLL 
NLYEGLVRFE PGGTGIEPAL ARDWNVSPDA RTWTFYLQED ISFTDGTPLD AAAVRDAVER
QLNPETAGPY ASFVYGPVTR IETKGRHTVI FHLKHPYAPF IRNLAMLPAA VVRPSPDHGL
PIGTGPFVPS AIESARITLK ANPAYREGPP HLKEVLFVVI PDPHERWRAL AQGRVDVAEN
TGAALPATGP DSLVIARTPG LDLSYLAFYT NKKPFDNPAV RRAASLAVNQ QAIVDYLFPD
RAVPAIGPLP PGTLGHHPTL GADAYNLEEA RQLLDQAGYS GEEITLITYQ DRRPYNPAGG
EKLAHLLVEQ LAQAGFKVRV EAYPWEICKH AIHRQEGHAF VFGWVGDNGD PDNFLYTLLA
SAQIQTGTNA ARYSNPHVDM LLGRAQQVTD EALRERLYRQ AQELIAADAP WVFLNHRLET
AAHHPTVKNL VVQPTGGAYL AQVRKDDQ