Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3646 |
Symbol | tsgA |
ID | 6146691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3704697 |
End bp | 3705878 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618473 |
Product | hypothetical protein |
Protein accession | YP_001745613 |
Protein GI | 170682755 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0738] Fucose permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.865659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000149858 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTAACA GCAATCGCAT CAAGCTCACA TGGATTAGCT TTCTCTCCTA CGCACTGACC GGTGCGTTGG TTATTGTCAC CGGGATGGTG ATGGGAAATA TCGCCGATTA TTTCAATCTG CCTGTTTCCA GTATGAGTAA TACCTTCACC TTCCTCAACG CCGGCATTTT AATCTCTATC TTCCTCAACG CCTGGCTGAT GGAAATCGTC CCGTTGAAAA CGCAGTTACG TTTTGGCTTT CTCCTGATGG TGCTGGCGGT TGCCGGTTTG ATGTTCAGCC ACAGCCTGGC ACTGTTCTCG GCGGCGATGT TCATTCTCGG GGTGGTCAGC GGCATCACCA TGTCGATTGG TACATTCCTG GTAACACAAA TGTATGAAGG ACGTCAGCGC GGTTCACGCC TGTTATTTAC CGACTCCTTC TTCAGTATGG CCGGGATGAT TTTCCCAATG ATCGCCGCGT TTCTGCTGGC GCGCAGCATT GAGTGGTACT GGGTTTATGC CTGCATCGGG CTGGTGTACG TCGCTATCTT TATTCTGACC TTCGGCTGTG AGTTCCCGGC GCTGGGTAAA CATGCGCCAA AAACGGACGC TCCGGTAGCG AAAGAAAAAT GGGGGATCGG CGTGCTGTTT CTCTCCATTG CGGCACTGTG CTACATCCTC GGTCAGTTAG GTTTTATCTC CTGGGTGCCT GAGTATGCCA AAGGCCTGGG CATGAGCCTG AACGACGCGG GCACGCTGGT GAGTAACTTC TGGATGTCAT ACATGGTCGG CATGTGGGCG TTCAGCTTTA TTCTTCGCTT CTTTGATTTG CAACGCATTC TGACCGTACT GGCTGGTCTG GCTGCGATTC TGATGTACGT CTTTAACACC GGAACACCGG CACATATGGC GTGGTCAATT CTCGCCCTGG GCTTCTTCTC CAGCGCGATC TATACCACCA TCATCACTTT GGGTTCACAG CAGACCAAAG TACCGTCGCC AAAACTGGTT AACTTTGTCC TGACCTGCGG GACCATCGGT ACTATGTTGA CCTTTGTGGT TACCGGCCCG ATTGTTGAAC ATAGCGGTCC GCAGGCGGCG CTGCTGACGG CAAACGGTCT GTACGCTGTC GTCTTTGTGA TGTGCTTCCT GTTAGGTTTC GTCAGCCGTC ACCGTCAGCA TAACACCCTG ACCTCTCATT AA
|
Protein sequence | MTNSNRIKLT WISFLSYALT GALVIVTGMV MGNIADYFNL PVSSMSNTFT FLNAGILISI FLNAWLMEIV PLKTQLRFGF LLMVLAVAGL MFSHSLALFS AAMFILGVVS GITMSIGTFL VTQMYEGRQR GSRLLFTDSF FSMAGMIFPM IAAFLLARSI EWYWVYACIG LVYVAIFILT FGCEFPALGK HAPKTDAPVA KEKWGIGVLF LSIAALCYIL GQLGFISWVP EYAKGLGMSL NDAGTLVSNF WMSYMVGMWA FSFILRFFDL QRILTVLAGL AAILMYVFNT GTPAHMAWSI LALGFFSSAI YTTIITLGSQ QTKVPSPKLV NFVLTCGTIG TMLTFVVTGP IVEHSGPQAA LLTANGLYAV VFVMCFLLGF VSRHRQHNTL TSH
|
| |