Gene Tbd_1426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1426 
Symbol 
ID3672267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp1515546 
End bp1516646 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID637710111 
ProductACR3 family arsenite transporter 
Protein accessionYP_315184 
Protein GI74317444 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.653925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGC AGTGTGAAGT CACCGCCAAG CGCGCGGCCG CCATGGGCGG CCCGCCGGCG 
CCGATGAGCG TGTTCGAGCG CTGGCTCACC CTGTGGGTCG CGCTGTGCAT CGTCGCCGGC
GTGGCACTCG GGCAACTGTT CCCGGCGCCA TTCCAGGCGC TCGGGCGGAT GGAGGTGGCG
CAGGTCAATC TGCCGGTCGG CCTGTTGATC TGGATCATGA TCATCCCGAT GCTGATGAAG
ATCGACTTCG GCGCGCTGCA TCAGGTGAAA TCGCACTGGC GCGGCATCGG CGTCACGCTC
TTCGTCAACT GGGCGGTGAA GCCGTTCTCG ATGGCACTCC TGGCGTGGAT CTTCATCCGC
CATTTGTTCG CGCCCTGGCT GCCCGCCGAG CAGCTCGACA GCTACGTCGC CGGCCTGATC
CTGCTCGCGG CCGCGCCGTG CACGGCGATG GTGTTCGTGT GGAGCCGCCT GACCGGCGGC
GATCCATATT TCACGCTGTC GCAGGTGGCG CTCAACGACA CCATCATGAT CTTCGCCTTC
GCGCCGATCA TCGGGCTGCT CCTGGGCCTT TCCGCGATCG TGGTGCCGTG GGACACGCTC
ATGATTTCGG TCGCGCTTTA TATCGTGCTT CCGGTGATCC TCGCGCAGGT CTGGCGCAAG
CGGCTGCTGA AGCGCGGGCA GGCGGTGTTC GACCGGGTGA TGGCGCAGCT CGGCCAGGCT
TCGATCCTCG CGCTGCTGGC GACGCTGGTG CTGCTGTTCG CCTTTCAGGG CGAGCAGATC
CTCGCGCAGC CGTTGATCAT CGCGTTGCTC GCAGTGCCGA TCCTGATCCA GGTCTTCTTC
AACTCGGGCT TGGCCTACTG GCTGAACAGA AAAGTGGGCG AGAAGCACGC CGTCGCCTGC
CCGTCGGCGC TGATCGGCGC GTCGAACTTC TTCGAGCTGG CGGTGGCGGC GGCGATCGCG
CTGTTCGGCT TCGAGTCGGG CGCGGCGCTG GCGACCGTGG TCGGCGTGCT GATCGAGGTG
CCGGTGATGC TGCTGGTGGT GAAGCTCGTC AACCGCAGCA AGCGCTGGTA CGAGCGCGGC
CTGCCAGCGG GACGGGCCTG A
 
Protein sequence
MSAQCEVTAK RAAAMGGPPA PMSVFERWLT LWVALCIVAG VALGQLFPAP FQALGRMEVA 
QVNLPVGLLI WIMIIPMLMK IDFGALHQVK SHWRGIGVTL FVNWAVKPFS MALLAWIFIR
HLFAPWLPAE QLDSYVAGLI LLAAAPCTAM VFVWSRLTGG DPYFTLSQVA LNDTIMIFAF
APIIGLLLGL SAIVVPWDTL MISVALYIVL PVILAQVWRK RLLKRGQAVF DRVMAQLGQA
SILALLATLV LLFAFQGEQI LAQPLIIALL AVPILIQVFF NSGLAYWLNR KVGEKHAVAC
PSALIGASNF FELAVAAAIA LFGFESGAAL ATVVGVLIEV PVMLLVVKLV NRSKRWYERG
LPAGRA