Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2565 |
Symbol | |
ID | 6143329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2617924 |
End bp | 2618922 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617435 |
Product | bile acid/Na+ symporter family protein |
Protein accession | YP_001744600 |
Protein GI | 170680783 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000897855 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTT TTCGTATCCT CGATCCTTTC ACCTTAACCC TGATCACGGT GGTGCTGCTG GCCTCTTTCT TTCCGGCCAG AGGCGATTCC GTTCCCTTCT TTGAAAATCT GACTACCGCA GCCATTGCTC TGCTGTTCTT TATGCACGGC GCGAAGTTAT CGCGAGAAGC CATTATTGCA GGCGGTGGTC ATTGGCGACT GCATTTATGG GTGATGTGTA GCACCTTCGT GCTGTTCCCA ATTTTGGGCG TGCTGTTTGC CTGGTGGAAA CCGGTAAATG TCGCCCCGAT GCTCTATTCC GGTTTTCTCT ACTTGTGCAT TCTTCCTGCT ACCGTGCAGT CTGCAATCGC CTTCACGTCA ATGGCGGGCG GTAACGTCGC GGCGGCGGTT TGTTCTGCGT CAGCATCCAG CCTGCTGGGG ATTTTCCTTT CACCATTGCT GGTTGGTCTG GTGATGAATG TTCACGGTGC AGGGGGCAGC CTTGAGCAGG TCGGTAAAAT TATGCTGCAA CTGCTGCTGC CGTTTGTGTT GGGGCATCTT TCCCGGCCGT GGATTGGTGA CTGGGTGTCG CGCAATAAAA AATGGATTGC GAAAACTGAC CAGACGTCCA TTCTGTTGGT GGTTTATACG GCATTCAGCG AAGCCGTCGT TAACGGTATC TGGCACAAAG TTGGCTGGGG ATCATTGCTG TTTATCGTGG TGGTCAGTTG CGTTCTTCTG GCTATCGTGA TTGTAGTTAA CGTCTTTATG GCACGCCGAC TGGGCTTCAA TAAGGCAGAT GAAATTACTA TCGTCTTTTG TGGTTCGAAA AAGAGTCTGG CAAATGGCAT CCCGATGGCA AACATTCTGT TCCCCACATC GGTGATCGGT ATGATGGTGC TGCCTCTGAT GATTTTCCAT CAGATCCAAT TGATGGTCTG TGCGGTGCTG GCGCGTCGAT ACAAACGCCA GACCGAACAG TTACAGGCGC AGCAGGAAAG CAGCGCCGAT AAAGCTTAA
|
Protein sequence | MKLFRILDPF TLTLITVVLL ASFFPARGDS VPFFENLTTA AIALLFFMHG AKLSREAIIA GGGHWRLHLW VMCSTFVLFP ILGVLFAWWK PVNVAPMLYS GFLYLCILPA TVQSAIAFTS MAGGNVAAAV CSASASSLLG IFLSPLLVGL VMNVHGAGGS LEQVGKIMLQ LLLPFVLGHL SRPWIGDWVS RNKKWIAKTD QTSILLVVYT AFSEAVVNGI WHKVGWGSLL FIVVVSCVLL AIVIVVNVFM ARRLGFNKAD EITIVFCGSK KSLANGIPMA NILFPTSVIG MMVLPLMIFH QIQLMVCAVL ARRYKRQTEQ LQAQQESSAD KA
|
| |