Gene EcSMS35_3183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3183 
SymbolshiF 
ID6142947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3265340 
End bp3266533 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content56% 
IMG OID641618023 
Producttransport protein ShiF 
Protein accessionYP_001745173 
Protein GI170683143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCTA AGATCGAAGA TACGCCCCAA AAAACCCTGT CCTGCTGGCC ACTGGCGTTC 
AGTGCCGGTC TTCTCGGTAT CGGACAGAAC GGTCTGCTGG TTGTACTCCC TGTTCTGGTC
ATACAGACAA ATCTGAGTCT GTCTGTATGG GCTGCCCTGC TGATGCTGGG CTCAATGCTG
TTTCTGCCAT CTTCCCCATG GTGGGGAAAG CAAATTTCCC TTACTGGCAG TAAGACTGTG
GTGCTGTGGG CTCTGGGAGG ATATGGCGTA AGCTTTACCC TGCTTGGGCT GGGAAGCGTG
CTGATGGCTA CCGGTGCCGT AACAAAAGCG GTGGGGTTGG GAATATTAAT CATCGCCCGG
ATCGTCTACG GTCTGACCGT GTCAGCAATG GTGCCAGCCT GTCAGGTCTG GGCATTGCAG
AGAGCGGGAG AAGGGAATCG CATGGCCGCT CTGGCAACCA TCAGCTCCGG CCTGAGCTGC
GGCAGGCTAT TCGGGCCGCT GTGCGCGGCG GCAATGTTGG TCATTCACCC TCTGGCGCCA
GTGTGGATGC TGATGGCAGC GCCAGCGCTG GCACTGGTGA TGCTTCTGCG GTTGCCCGGC
ACACCACCAC AGCCCACACC GGAGCGCAAG AGCGTCAGCC TGAAGCGGGA TTTCCTGCCT
TATCTGCTTT GCGCAATGTT ACTGGCTGCG GCAATGAGCA TGATGCAGCT TGGACTTTCG
CCAGCCCTTA CTCGCCAGTT CGCCACTGAT ACCACCACTA TTAGCCAACA GGTAGCGTGG
TTGTTGGGGC TGTCCGCAAT AGCTGCGCTT ATCGCGCAGT TCGTGGTACT CCGTCCACAG
CGCCTGACTC CAGTGGCTCT GCTCCTGAGT GCCGGGGTGT TGATGAGTAG TGGTCTGGCT
ATCATGCTCG CTGAACAGCT ATGGTTGTTT TACCTAGGCT GTGCAGTGCT GTCATTTGGA
GCTGCTCTGG CAACCCCCGC TTATCAACTT TTACTGAATG ATAAGCTGGC CGACGGCGCA
GGCGCGGGCT GGCTCGCTTG CAGTCACACA CTTGGCTATG GGCTTTGCGC GTTGTTGGTA
CCATTGGTGT CGAAAACAGG TGTCGCAATA GCACTGATTG TGATGGCATT ATTTGCCGCT
GTATTATTTA GCATGGTGAC TGTATTTATC TGGCGCTGCT GCAAAAGCAA GTAA
 
Protein sequence
MSSKIEDTPQ KTLSCWPLAF SAGLLGIGQN GLLVVLPVLV IQTNLSLSVW AALLMLGSML 
FLPSSPWWGK QISLTGSKTV VLWALGGYGV SFTLLGLGSV LMATGAVTKA VGLGILIIAR
IVYGLTVSAM VPACQVWALQ RAGEGNRMAA LATISSGLSC GRLFGPLCAA AMLVIHPLAP
VWMLMAAPAL ALVMLLRLPG TPPQPTPERK SVSLKRDFLP YLLCAMLLAA AMSMMQLGLS
PALTRQFATD TTTISQQVAW LLGLSAIAAL IAQFVVLRPQ RLTPVALLLS AGVLMSSGLA
IMLAEQLWLF YLGCAVLSFG AALATPAYQL LLNDKLADGA GAGWLACSHT LGYGLCALLV
PLVSKTGVAI ALIVMALFAA VLFSMVTVFI WRCCKSK