Gene GSU1411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1411 
Symbol 
ID2685976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1549047 
End bp1550042 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content62% 
IMG OID637126085 
Productsodium/bile acid symporter family protein 
Protein accessionNP_952463 
Protein GI39996512 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.618786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCAAC TGCTGAGCAG ACTCAACAAG AACCTGGTGC TGACCATTCC CGCCATGATG 
GCCGCGGGCT TCGTCTTCGG TCTCGTGACT GAGACCGGCT TTCTCAAGGA GCTGATCATC
CCCTTCACCT TTCTCATGGT CTACCCCATG ATGGTGACCC TTAAGCTGAA GAAGGTACTG
GAGGGAGGTG ACGGCAAGGC CCAGCTGTTC ACCCAGTTCA TCAACTTCGC GGTGGTGCCC
TTTGTCGCCT TCGGGCTGGG AAGGCTTTTC TTTGCTGACC GCCCCTACAT GGCCCTGGGC
CTGCTGCTTG CGGCCCTGGT CCCCACCAGC GGCATGACCA TCTCCTGGAC CGGCTTTGCC
AGGGGGAACC TCGAAGCGGC AGTCAAGATG ACCGTCGTGG GCCTCATCCT GGGCTCGATC
GCAACCCCCT TCTACGTGCA AGCCCTCATG GGCGCCCATG TGGAGGTGGA TATGACCGGC
ATCATGAAGC AAATCGCCGT CATCGTTTTC CTGCCCATGG CGGTCGGGTA TGCGACCCAG
CGCTATCTGA TAGCGAAGCA CGGCCAGAAA GGATTCCAGG AACAGTGGGC GCCCCGCTTC
CCCTGCCTCT CCACCCTGGG CGTCCTGGGC ATCGTTTTCA TCGCCATAGC CCTCAAGGCA
CAGGCCATCG CGGCCCGCCC CCAGGATCTG CTGGCAATCC TGCTGCCCCT GGCCATTCTC
TACGGCATCA ACTATAGCCT CAGCACCGTT GTGGGCAGGC TTTTCCTTCC CCGGGGCGAT
GCTATCGCCC TGGCCTACGG CACGGTGATG CGCAACCTCT CCATTGCCCT GGCCGTGGCC
ATGAACGCCT TCGGCAAGGC CGGTTCCGAC GCGGCGCTGG TCATTGCCCT GGCCTACATC
ATCCAGGTGC AGTCAGCCGC CTGGTACGTA AAATTTACCG GCACGCTCTT CGGTGAGGCA
CCAGCGCCGT CCTGTCCCGC ACCGAGGCGG AACTGA
 
Protein sequence
MWQLLSRLNK NLVLTIPAMM AAGFVFGLVT ETGFLKELII PFTFLMVYPM MVTLKLKKVL 
EGGDGKAQLF TQFINFAVVP FVAFGLGRLF FADRPYMALG LLLAALVPTS GMTISWTGFA
RGNLEAAVKM TVVGLILGSI ATPFYVQALM GAHVEVDMTG IMKQIAVIVF LPMAVGYATQ
RYLIAKHGQK GFQEQWAPRF PCLSTLGVLG IVFIAIALKA QAIAARPQDL LAILLPLAIL
YGINYSLSTV VGRLFLPRGD AIALAYGTVM RNLSIALAVA MNAFGKAGSD AALVIALAYI
IQVQSAAWYV KFTGTLFGEA PAPSCPAPRR N