Gene Dtox_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0450 
Symbol 
ID8427385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp466023 
End bp467582 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content42% 
IMG OID645032826 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003190004 
Protein GI258513782 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.15126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAA GGAAAATCAT GTTTATCGTA GTTTGTATAC TGATGACCAT CAGCGTCCTT 
CTCACATCAG GTTGTGGTGT AAAAAAAGAA AATAAGGACA GTAATAAAAA ATTGGTAGTA
GGAGAAATGT GGAAAATCGA CAGTATTGAT CCGTCAACTA GTGGCACTTT GACGACTGAA
AAAGCCATGA TCACAGAAAC ATTGGTAGGA GTAAAGGAGA ATTTTGAGTT AAAACCCGGT
CTTGCAACTG AATGGAAAAG AATAGATGCA AACACATGGA GTATTGTTCT TCGCCAAGGG
GTCAAATTCC ATGACGGCAC ACCGATGGAT GGAGAAGCAG TGAAATGGTC GCTCATGAGA
GCTATTGCTG TTAACCCGCA GGTTCAAACA TTTACAAAAA TAAAATCAAT AGAAGTAGAG
AACGATCATA CATTAAAAAT TACAACAACA GCTCCAACCG GCGATTTCCC TGCTTCGCTG
CATTATATGG GCGCATCTGT TATCGCGCCA AGCTCGGTTG ATAAAGCCGG CAAACTTATC
AAACCTATCG GTACGGGACC TTTTATGTTA GAACAATTTG AAGCATCAAC AGGCGACATG
GAGATGGCAA AGTATAAGGA CTACTGGGGA ACTCCCGCCA AGCTGGAAAA ATTGGAAATA
CGTCCTTTGC CGGATCCCAA TACCCGTGCT CTGGCCTTGG AAAAAGGGGA AATAGATTTT
ACCTGTGATC CACCTTATAA TGAACTGGAG AGATTGGGCA AGGAGAAAGG ACTTAAAGTC
GAACTCAATC CTACCGCCAG GACATATATT GTGGAAATGA ACCTGAAAAA AGAACCGTTC
AACGATGTAA GAGTCAGAAA AGCTTTGAGC TATGCTATTG ACAGGGAAAG TATTACTAAA
CATGTGCTTT TCGGATGCGG AACTCCTGCC AAGGGGCCAT TCATGCCGGG AATGGCGTGG
ACAAACGAAA ATTTAAAAGG TTATCCCTAT TCTCCCGGCA AGGCAAAATT ACTATTGGAA
GAAGCGGGTT GGAAAGATAC TGATGGAGAC GGAATCATAG ATAAAAACGG CCAACCGCTG
AAAATAACCC TTATGACTTA TCCGCAGAGA CCGGGCCTTC CTCCAATGGC TCAAGCTTTA
CAGGATCAAT TCAAACAAGC AGGGATTGAT TTGAAGATTG AGATTATGGA AAATTCCGCC
ATGAGCCAAG TTGCTTCAAC AGGCAAATGG GATATGAAAA TGAGTGCTTT TGCGACCGCT
ATGATTCCTA CTCCCAGCTA TCATTTACAA GTACTGTATT ATTCTGAAAA CAACAAATTA
ATTGGCTACA ATAATGCAAA AGTTGACCAA TTGATCGATG AATGTGTTGC TGTAGATGAC
CAGCAGAAAA AGTATGAGCT TTCCAAACAG GTACAGCAAA TTCTGGAAGA TGAAGTGCCT
GTACTACCAA TTGCCTATTA CGGTGTGGCC GCTGTCATGA ACAGCAAAAT CGAGAACTTT
GTATTTAATC CTACAGCTCA TGATTACATG CTTACAACAG AAGTGGGGAT TAAGGAGTAG
 
Protein sequence
MIKRKIMFIV VCILMTISVL LTSGCGVKKE NKDSNKKLVV GEMWKIDSID PSTSGTLTTE 
KAMITETLVG VKENFELKPG LATEWKRIDA NTWSIVLRQG VKFHDGTPMD GEAVKWSLMR
AIAVNPQVQT FTKIKSIEVE NDHTLKITTT APTGDFPASL HYMGASVIAP SSVDKAGKLI
KPIGTGPFML EQFEASTGDM EMAKYKDYWG TPAKLEKLEI RPLPDPNTRA LALEKGEIDF
TCDPPYNELE RLGKEKGLKV ELNPTARTYI VEMNLKKEPF NDVRVRKALS YAIDRESITK
HVLFGCGTPA KGPFMPGMAW TNENLKGYPY SPGKAKLLLE EAGWKDTDGD GIIDKNGQPL
KITLMTYPQR PGLPPMAQAL QDQFKQAGID LKIEIMENSA MSQVASTGKW DMKMSAFATA
MIPTPSYHLQ VLYYSENNKL IGYNNAKVDQ LIDECVAVDD QQKKYELSKQ VQQILEDEVP
VLPIAYYGVA AVMNSKIENF VFNPTAHDYM LTTEVGIKE