Gene Dtox_2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2133 
Symbol 
ID8429115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2310288 
End bp2311535 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content45% 
IMG OID645034453 
ProductAluminium resistance family protein 
Protein accessionYP_003191584 
Protein GI258515362 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4100] Cystathionine beta-lyase family protein involved in aluminum resistance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.821926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0213182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTACATAG AGCTTGAGCA ATTGGAAGAG CTGGCGCTGG AAGCAGAAAA TGAGTTGTTG 
CCTGTGTACC GTGAATTGGA TAAAACTTCG TGGATTAATC ACAGCAAAGT TTTGGCTGCT
TTTCAAGAAG AAAAGGTCAG CGATTTTCAT TTAAAGAGTT CCTCAGGTTA CGGTTATAAC
GATATGGGCC GGGAGATTTT AGAGAAGTTA TATGCACGCA TCTTTGGTGC TGAAGCAGCT
TTAGTACGCA GTCAAATCGT CTCAGGTACT CATGCAATGG CCATTTGCCT ATTTGGTATC
CTGCGTCCTG GAGATGAGCT GGTTTCAGCA ACCGGGACTC CTTATGATAC ACTTGAAGAA
ATTATAGGTA TCAGGGGCAG CGGGGGCGGT TCTTTAAAAG AATTTGGCAT CGCTTATCGC
CAGGTTGAAT TATTGCCGGA TGGAAAACTG GATTATGAAA AATTGAAGGA AGCTGTTAGC
TCACAAACTA AATGTATTAT GCTGCAAAGG TCCAGAGGTT ATTCGGAACG TCCTGCTTTA
ACGGTAGCAC AAATAGGTGA ATTATGCAGT TTTGTTAAGC AGAACTGGCC CTCCATAATT
GTTTTTGTGG ACAACTGCTA CGGAGAGTTT GTAGAGACAT TAGAACCTTG TGATGTAGGA
GCTGATTTGG TTGCCGGTTC ATTGATTAAA AATCCTGGTG GGGGGTTGGC TCCTACAGGT
GGTTATATTG TGGGTCGCAG TGAGCTTGTT GAATTGGCCG CCAACCGTTG GACAGCCCCG
GGCATCGGAG CTGAGGTCGG TCCTTCACCT GATTTTCAGC GACTATTATA TCAAGGACTT
TTTATTTCTC CCCATATTGT TAACGAATCA CTTAAAGGAG CAGTGTTTAC AGCCAAACTT
TTTGAACGAC TACGGTTTAG AGTTTTTCCT GCCGCTGAGG ATTATAGGAC AGATATTATT
CAAGCCGTGG AACTAGGTTC GCCGGAAAAG GTAATTGCTT TTTGTCGGGG AATTCAAAAA
GCTTCACCGG TAGATGCTCA TGTTATTCCG GAACCGTGGG ACATGCCTGG TTATGGTGAT
CAGGTAATTA TGGCTGCCGG CACTTTTGTT CAGGGTGCTT CTTTAGAACT GACAGCTGAC
GCGCCGATTC GCCGGCCTTT CATAGTTTAC CTGCAAGGAG GTTTATCCAG GCAATATGTA
AAGTTGGGTG TGCTGTCCGC GGCCAAGTTT GTGCTTGGGT TAGGTTAA
 
Protein sequence
MYIELEQLEE LALEAENELL PVYRELDKTS WINHSKVLAA FQEEKVSDFH LKSSSGYGYN 
DMGREILEKL YARIFGAEAA LVRSQIVSGT HAMAICLFGI LRPGDELVSA TGTPYDTLEE
IIGIRGSGGG SLKEFGIAYR QVELLPDGKL DYEKLKEAVS SQTKCIMLQR SRGYSERPAL
TVAQIGELCS FVKQNWPSII VFVDNCYGEF VETLEPCDVG ADLVAGSLIK NPGGGLAPTG
GYIVGRSELV ELAANRWTAP GIGAEVGPSP DFQRLLYQGL FISPHIVNES LKGAVFTAKL
FERLRFRVFP AAEDYRTDII QAVELGSPEK VIAFCRGIQK ASPVDAHVIP EPWDMPGYGD
QVIMAAGTFV QGASLELTAD APIRRPFIVY LQGGLSRQYV KLGVLSAAKF VLGLG