Gene EcSMS35_3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3117 
SymbolrafY1 
ID6147373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3204942 
End bp3206336 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content39% 
IMG OID641617984 
Productglycoporin RafY 
Protein accessionYP_001745134 
Protein GI170680709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.303437 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGT TACTTTTGGG TTTATCTCTT TTTTTATTAC TGGAATACAA TGCGTGTGCA 
GCAAAAACAG GGGGGCTAAC GCTTGAACAA AGAATGGCAT TGCTGGAAGA ACGCCTGGAA
GCTGCAGAAA AAAGGGCGGA AAAAGCAGAA AGAATGTTAA AGAGTTTTGA TATAGAACAA
CATTCTGAAA TACGTCAAAT TAGTTCAGAG CAGAATAAAA AAGATGCTAA CAGCTATGCA
GTAGTTGAGT CAACAAAAGA AAAAACATCT TCTCCAGGTT TTCTGCTCTC TGGTTACAAT
GATTTGAAGT TTTATGGTGA TGTGGAGTTT AATATAGATG CTGCCAGTAA ACCCGGTCAG
TTAGTAATGA TAAGTTCCGG GGCGAACAGT GAGTCAGTGA ATGAACGGTG GGACCTTAAT
GGCCGTATTC TATTAGGTTT TGACGGTACC CGTAAGCTTG ATAATGGTTA TTTCGCTGGA
TTTTCAGCAC AACCACTGGC GGATATGCAC GGTTCAGTGA ATATTGATGA TGCATTATTC
TTTTTTGGAA AAGATGACGA GTGGAAAGTG AAAGTCGGTC GTTTTGAAGC TTACGATATG
TTCCCTCTAA ATCAGGATAC TTTCATTGAG TATTCCGGTA ATACAGCTAA TGATATTTAT
GCTGATGGCC GTGGTTATAT CTATATGATG AAAGAGGGAC GCGGTCGTTC TGACGCTGGT
GGTAATTTCC TCATCAGTAA ACAACTCGAT AACTGGTATT TTGAGTTAAA CACGTTACTT
GAAGACGGAA CATCTTTATA TAATAACGGT AATTATCATG GACGATATAT GGAGCAGCAG
AAAAATGTCG CTTATCTGCG CCCGGTAATT GCCTGGTCGC CAACGGAAGA ATTCACAGTC
TCCGCAGCGA TGGAAGCGAA CGTGGTAAAA AATGCTTATG GCTATACCGA TAATAAGGGG
AATTTTGTCG ATCAGTCCAA TCGTTCCGGT TATGGTATTA GCATGACATG GAACGGTCTG
AAAACAGATC CGGAAAATGG CATCGTGGTT ACTCTTAATA CCGCCTATTT GGATGCCAGT
AATGAGAAAG ATTTCACTGC CAGTATTAAT GCTCTGTGGA AACGTTTCGA ACTGGGTTAT
ATCTACGCAC ATAATAAGAT TGATGAATTT AGTGGTGTCG TGTGTAATAA TGGTTGCTGG
ATTGATGGTG AAGGAATATA CAACATTCAC ACCATTCATG CATCTTATCA GTTTGCTAAT
GTGATGGATA TGGAGAACTT TAATATTTAC CTCGGGGCTT ATTACTCAAT TCTGGATAGT
AACTGTAGAT ATAGTAATTG TGACATTACC GATGATCGTT ACGGTGCCCG ATTACGTTTC
AAATACTTTT TTTGA
 
Protein sequence
MNKLLLGLSL FLLLEYNACA AKTGGLTLEQ RMALLEERLE AAEKRAEKAE RMLKSFDIEQ 
HSEIRQISSE QNKKDANSYA VVESTKEKTS SPGFLLSGYN DLKFYGDVEF NIDAASKPGQ
LVMISSGANS ESVNERWDLN GRILLGFDGT RKLDNGYFAG FSAQPLADMH GSVNIDDALF
FFGKDDEWKV KVGRFEAYDM FPLNQDTFIE YSGNTANDIY ADGRGYIYMM KEGRGRSDAG
GNFLISKQLD NWYFELNTLL EDGTSLYNNG NYHGRYMEQQ KNVAYLRPVI AWSPTEEFTV
SAAMEANVVK NAYGYTDNKG NFVDQSNRSG YGISMTWNGL KTDPENGIVV TLNTAYLDAS
NEKDFTASIN ALWKRFELGY IYAHNKIDEF SGVVCNNGCW IDGEGIYNIH TIHASYQFAN
VMDMENFNIY LGAYYSILDS NCRYSNCDIT DDRYGARLRF KYFF