Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0631 |
Symbol | |
ID | 6143864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 644517 |
End bp | 645947 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615523 |
Product | sodium:sulfate symporter family protein |
Protein accession | YP_001742729 |
Protein GI | 170680678 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCGCCCC TGGTGGTGAT GGGTGTCATG TTTCTTATCC CTGTACCTGA CGGTATGCCG CCGCAGGCGT GGCATTACTT TGCAGTGTTT GTGGCAATGA TTGTCGGCAT GATCCTCGAG CCAATTCCGG CAACGGCGAT CAGTTTTATT GCGGTTACTA TTTGCGTTAT TGGCAGTAAT TACCTGCTCT TTGATGCCAA AGAATTAGCT GACCCAGCGT TTAATGCGCA AAAACAGGCG CTGAAATGGG GGCTGGCAGG TTTCTCCAGC ACCACCGTCT GGCTGGTGTT TGGCGCATTT ATTTTTGCGC TGGGCTATGA AGTCACCGGT CTGGGCCGTC GTATCGCCCT TTTCCTGGTG AAATTCATGG GTAAACGCAC GCTGACGCTG GGTTACGCGA TTGTCATTAT CGACATTCTG CTGGCACCGT TTACACCGTC CAACACCGCG CGTACCGGGG GTACGGTTTT TCCGGTCATT AAAAACCTGC CGCCGCTGTT TAAATCATTC CCGAACGATC CGTCCGCGCG TCGTATTGGC GGCTATTTGA TGTGGATGAT GGTCATTAGT ACCAGTCTGA GTTCGTCCAT GTTTGTCACC GGTGCGGCAC CAAACGTGCT GGGTCTGGAG TTCGTCAGCA AAATCGCAGG GATCCAGATT AGCTGGCTGC AATGGTTCCT GAGCTTCCTG CCGGTCGGTA TTATTTTGCT GATCGTTGCT CCGTGGCTCT CCTATGTGCT GTACAAGCCG GAAGTGACTC ACAGTGCCGA AGTGGCAGCA TGGGCCGGTG ATGAACTGAA AACGATGGGT GCATTGAGCC GTAAAGAGTG GACCCTGATA GGTCTGGTGC TGCTGAGCTT AGGCTTATGG GTATTCGGCG GCGAAATGAT CGACGCCACG GCGGTAGGTC TGCTGGCGGT TTCGCTGATG CTGGCCCTGC ACGTTGTACC GTGGAAAGAC ATTACCCGCT ACAACAGCGC CTGGAACACA CTGGTCAACC TGGCAACGCT GGTTGTTATG GCGAACGGTT TGACCCGCTC TGGTTTTATC GACTGGTTCG CTAGCACCAT GAGCACGCAC CTGGAAGGCT TCTCACCGAA CGCAACGGTA ATTGTACTGG TTCTGGTGTT CTACTTTGCA CACTACCTGT TTGCCAGCCT GTCTGCGCAC ACCGCAACCA TGCTGCCGGT TATTCTGGCC GTCGGTAAAG GTATTCCGGG CGTACCAATG GAACAACTGT GTATCCTGCT GGTGCTGTCT ATCGGTATCA TGGGCTGTCT GACGCCGTAT GCAACCGGTC CTGGGGTGAT TATTTACGGC TGTGGCTATG TGAAATCAAA AGATTACTGG CGTCTTGGCG CAATCTTCGG GGTGATTTAC ATCTCTATGC TGCTGTTGGT TGGCTGGCCG ATTCTCGCCA TGTGGAACTA A
|
Protein sequence | MAPLVVMGVM FLIPVPDGMP PQAWHYFAVF VAMIVGMILE PIPATAISFI AVTICVIGSN YLLFDAKELA DPAFNAQKQA LKWGLAGFSS TTVWLVFGAF IFALGYEVTG LGRRIALFLV KFMGKRTLTL GYAIVIIDIL LAPFTPSNTA RTGGTVFPVI KNLPPLFKSF PNDPSARRIG GYLMWMMVIS TSLSSSMFVT GAAPNVLGLE FVSKIAGIQI SWLQWFLSFL PVGIILLIVA PWLSYVLYKP EVTHSAEVAA WAGDELKTMG ALSRKEWTLI GLVLLSLGLW VFGGEMIDAT AVGLLAVSLM LALHVVPWKD ITRYNSAWNT LVNLATLVVM ANGLTRSGFI DWFASTMSTH LEGFSPNATV IVLVLVFYFA HYLFASLSAH TATMLPVILA VGKGIPGVPM EQLCILLVLS IGIMGCLTPY ATGPGVIIYG CGYVKSKDYW RLGAIFGVIY ISMLLLVGWP ILAMWN
|
| |