Gene Sde_3779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3779 
Symbol 
ID3966834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4780243 
End bp4781259 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content45% 
IMG OID637922876 
Productarsenical pump membrane protein, putative 
Protein accessionYP_529246 
Protein GI90023419 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.567439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTTGT TTGAACGTTA CTTAAGTGTT TGGGTAGGCC TGTGCATTGT AGCTGGTGTA 
GGCCTTGGCT ATGTAATGCC GAGTGCGTTT AGCGCAATTG CTCACTTAGA AGTGGCCCAC
GTAAACATTC CCGTGGCGAT TTTTATTTGG GTAATGATTT ACCCCATGAT GATACAAGTG
GATTTTGCAT CCATTAAAGA TATTGGCAAA AAACCTAAAG GTTTAGTGTT AACACTATTA
ATCAATTGGC TTATTAAACC GTTCACAATG GCGGCGTTGG GCTGGTTGTT TTTTAAAATA
CTGTTTGCCG ATTTAGTCGA CCCCGCCACC GCAAGTGAAT ATATAGCGGG TATGATTTTA
CTGGGTGTAG CGCCATGTAC CGCTATGGTA TTTGTATGGA GCCAATTAAC CAAAGGCGAT
GCAAATTATA CGCTGGTACA AGTATCGGTT AACGATGTGA TTATGATTTT TGCCTTTGCG
CCTTTGGCCG CGTTTTTATT AGGCGTAACC GATATTACTG TGCCGTGGGA AACGTTGCTG
CTATCGGTTT TACTCTATGT GGTATTGCCA CTGGTTGCAG GCATAGCTAC ACGCAAAGCA
CTGGATGCAG CAGATAATCA CACTCGCTTA AATAATTTTG TGGGCATGTT AAAGCCATGG
TCGATTGTGG GCTTGCTCGC AACCGTAGTG TTGCTGTTTG GTTTTCAAGC CAACACTATT
TTAAGTGAGC CTATGGCAAT AGTGCTTATC GCCATCCCTT TGCTTATTCA AACCTACGGC
ATTTTTGCAA TCGCTTACGC AGGCGCAAAA TGCTTAAAGC TGCCCCACAA TATTGCCGCA
CCGGCATGCA TGATTGGTAC ATCTAACTTT TTCGAACTGG CGGTAGCGGT GGCCATTTCA
TTGTTTGGTT TGCATTCTGG CGCAGCCTTG GCAACGGTAG TGGGCGTATT GGTAGAAGTG
CCAGTGATGT TAAGCCTGGT TGCTTTTGCC AACCGTACTC GTCATTGGTT TGATTAA
 
Protein sequence
MGLFERYLSV WVGLCIVAGV GLGYVMPSAF SAIAHLEVAH VNIPVAIFIW VMIYPMMIQV 
DFASIKDIGK KPKGLVLTLL INWLIKPFTM AALGWLFFKI LFADLVDPAT ASEYIAGMIL
LGVAPCTAMV FVWSQLTKGD ANYTLVQVSV NDVIMIFAFA PLAAFLLGVT DITVPWETLL
LSVLLYVVLP LVAGIATRKA LDAADNHTRL NNFVGMLKPW SIVGLLATVV LLFGFQANTI
LSEPMAIVLI AIPLLIQTYG IFAIAYAGAK CLKLPHNIAA PACMIGTSNF FELAVAVAIS
LFGLHSGAAL ATVVGVLVEV PVMLSLVAFA NRTRHWFD