Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1411 |
Symbol | |
ID | 2685976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 1549047 |
End bp | 1550042 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637126085 |
Product | sodium/bile acid symporter family protein |
Protein accession | NP_952463 |
Protein GI | 39996512 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.618786 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCAAC TGCTGAGCAG ACTCAACAAG AACCTGGTGC TGACCATTCC CGCCATGATG GCCGCGGGCT TCGTCTTCGG TCTCGTGACT GAGACCGGCT TTCTCAAGGA GCTGATCATC CCCTTCACCT TTCTCATGGT CTACCCCATG ATGGTGACCC TTAAGCTGAA GAAGGTACTG GAGGGAGGTG ACGGCAAGGC CCAGCTGTTC ACCCAGTTCA TCAACTTCGC GGTGGTGCCC TTTGTCGCCT TCGGGCTGGG AAGGCTTTTC TTTGCTGACC GCCCCTACAT GGCCCTGGGC CTGCTGCTTG CGGCCCTGGT CCCCACCAGC GGCATGACCA TCTCCTGGAC CGGCTTTGCC AGGGGGAACC TCGAAGCGGC AGTCAAGATG ACCGTCGTGG GCCTCATCCT GGGCTCGATC GCAACCCCCT TCTACGTGCA AGCCCTCATG GGCGCCCATG TGGAGGTGGA TATGACCGGC ATCATGAAGC AAATCGCCGT CATCGTTTTC CTGCCCATGG CGGTCGGGTA TGCGACCCAG CGCTATCTGA TAGCGAAGCA CGGCCAGAAA GGATTCCAGG AACAGTGGGC GCCCCGCTTC CCCTGCCTCT CCACCCTGGG CGTCCTGGGC ATCGTTTTCA TCGCCATAGC CCTCAAGGCA CAGGCCATCG CGGCCCGCCC CCAGGATCTG CTGGCAATCC TGCTGCCCCT GGCCATTCTC TACGGCATCA ACTATAGCCT CAGCACCGTT GTGGGCAGGC TTTTCCTTCC CCGGGGCGAT GCTATCGCCC TGGCCTACGG CACGGTGATG CGCAACCTCT CCATTGCCCT GGCCGTGGCC ATGAACGCCT TCGGCAAGGC CGGTTCCGAC GCGGCGCTGG TCATTGCCCT GGCCTACATC ATCCAGGTGC AGTCAGCCGC CTGGTACGTA AAATTTACCG GCACGCTCTT CGGTGAGGCA CCAGCGCCGT CCTGTCCCGC ACCGAGGCGG AACTGA
|
Protein sequence | MWQLLSRLNK NLVLTIPAMM AAGFVFGLVT ETGFLKELII PFTFLMVYPM MVTLKLKKVL EGGDGKAQLF TQFINFAVVP FVAFGLGRLF FADRPYMALG LLLAALVPTS GMTISWTGFA RGNLEAAVKM TVVGLILGSI ATPFYVQALM GAHVEVDMTG IMKQIAVIVF LPMAVGYATQ RYLIAKHGQK GFQEQWAPRF PCLSTLGVLG IVFIAIALKA QAIAARPQDL LAILLPLAIL YGINYSLSTV VGRLFLPRGD AIALAYGTVM RNLSIALAVA MNAFGKAGSD AALVIALAYI IQVQSAAWYV KFTGTLFGEA PAPSCPAPRR N
|
| |