Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3183 |
Symbol | shiF |
ID | 6142947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3265340 |
End bp | 3266533 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618023 |
Product | transport protein ShiF |
Protein accession | YP_001745173 |
Protein GI | 170683143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCTA AGATCGAAGA TACGCCCCAA AAAACCCTGT CCTGCTGGCC ACTGGCGTTC AGTGCCGGTC TTCTCGGTAT CGGACAGAAC GGTCTGCTGG TTGTACTCCC TGTTCTGGTC ATACAGACAA ATCTGAGTCT GTCTGTATGG GCTGCCCTGC TGATGCTGGG CTCAATGCTG TTTCTGCCAT CTTCCCCATG GTGGGGAAAG CAAATTTCCC TTACTGGCAG TAAGACTGTG GTGCTGTGGG CTCTGGGAGG ATATGGCGTA AGCTTTACCC TGCTTGGGCT GGGAAGCGTG CTGATGGCTA CCGGTGCCGT AACAAAAGCG GTGGGGTTGG GAATATTAAT CATCGCCCGG ATCGTCTACG GTCTGACCGT GTCAGCAATG GTGCCAGCCT GTCAGGTCTG GGCATTGCAG AGAGCGGGAG AAGGGAATCG CATGGCCGCT CTGGCAACCA TCAGCTCCGG CCTGAGCTGC GGCAGGCTAT TCGGGCCGCT GTGCGCGGCG GCAATGTTGG TCATTCACCC TCTGGCGCCA GTGTGGATGC TGATGGCAGC GCCAGCGCTG GCACTGGTGA TGCTTCTGCG GTTGCCCGGC ACACCACCAC AGCCCACACC GGAGCGCAAG AGCGTCAGCC TGAAGCGGGA TTTCCTGCCT TATCTGCTTT GCGCAATGTT ACTGGCTGCG GCAATGAGCA TGATGCAGCT TGGACTTTCG CCAGCCCTTA CTCGCCAGTT CGCCACTGAT ACCACCACTA TTAGCCAACA GGTAGCGTGG TTGTTGGGGC TGTCCGCAAT AGCTGCGCTT ATCGCGCAGT TCGTGGTACT CCGTCCACAG CGCCTGACTC CAGTGGCTCT GCTCCTGAGT GCCGGGGTGT TGATGAGTAG TGGTCTGGCT ATCATGCTCG CTGAACAGCT ATGGTTGTTT TACCTAGGCT GTGCAGTGCT GTCATTTGGA GCTGCTCTGG CAACCCCCGC TTATCAACTT TTACTGAATG ATAAGCTGGC CGACGGCGCA GGCGCGGGCT GGCTCGCTTG CAGTCACACA CTTGGCTATG GGCTTTGCGC GTTGTTGGTA CCATTGGTGT CGAAAACAGG TGTCGCAATA GCACTGATTG TGATGGCATT ATTTGCCGCT GTATTATTTA GCATGGTGAC TGTATTTATC TGGCGCTGCT GCAAAAGCAA GTAA
|
Protein sequence | MSSKIEDTPQ KTLSCWPLAF SAGLLGIGQN GLLVVLPVLV IQTNLSLSVW AALLMLGSML FLPSSPWWGK QISLTGSKTV VLWALGGYGV SFTLLGLGSV LMATGAVTKA VGLGILIIAR IVYGLTVSAM VPACQVWALQ RAGEGNRMAA LATISSGLSC GRLFGPLCAA AMLVIHPLAP VWMLMAAPAL ALVMLLRLPG TPPQPTPERK SVSLKRDFLP YLLCAMLLAA AMSMMQLGLS PALTRQFATD TTTISQQVAW LLGLSAIAAL IAQFVVLRPQ RLTPVALLLS AGVLMSSGLA IMLAEQLWLF YLGCAVLSFG AALATPAYQL LLNDKLADGA GAGWLACSHT LGYGLCALLV PLVSKTGVAI ALIVMALFAA VLFSMVTVFI WRCCKSK
|
| |