Gene EcSMS35_4291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4291 
SymbolrafY2 
ID6146426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4394485 
End bp4395879 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content45% 
IMG OID641619112 
Productglycoporin RafY 
Protein accessionYP_001746236 
Protein GI170681010 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.216775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT CGACATTATC TTTAGCCATC GGTTTATTAT TGGCATGTAG TACCGGAATG 
GCAAAAACAC AGCATTTAAC GCTGGAACAA CGCATGGCAT TGCTGGAAGA ACGCCTGGAA
GCGGCAGAAA TGCGGGCAGC AAAAGCAGAG AGCCAGGTTA AACAGCTGCA GACACAACAA
GCCGCTGAGA TCCGCGAAAT TAAGGCTGCC CAGGGCAATA CGCCGGTAAA TGGACAGGCA
ACGGCGGAAT CTGCAAAGAA AAACTCCACC TCACCTAATC TTTTGCTCTC AGGTTATGGC
GATTTAAAAA TCTACGGCGA CGTAGAATTT AATATGGATG CAGAAAGTAA TCATGGCCTG
CTGGCAATGA CCAACGCTGA TGTGAATAGC GATCCCACTA ATGAACAGTG GAATCTCAAT
GGTCGTATTT TGTTAGGTTT TGATGGTATG CGAAAACTGG ATAATGGCTA TTTCGCCGGG
TTCTCCGCAC AACCGCTGGG GGACATGCAC GGTTCAGTAA ATATCGATGA TGCGGTATTC
TTCTTTGGGA AAGAGAATGA CTGGAAGGTC AAAGTCGGCC GTTTTGAAGC CTACGATATG
TTCCCGCTGA ATCAGGATAC CTTTGTTGAA CATTCCGGTA ATACTGCGAA CGATCTTTAT
GACGATGGCA GCGGTTATAT CTATATGATG AAAGAGGGCC GCGGGCGCTC TAACGCTGGC
GGTAATTTCC TCGTCAGCAA ACAACTTGAT AACTGGTATT TTGAGTTAAA CACGTTACTT
GAAGACGGAA CATCTTTATA TAATGACGGT AATTATCATG GACGCGATAT GGAGCAGCAG
AAAAATGTTG CTTATCTGCG TCCGGTAATT GCCTGGTCGC CGACGGAAGA ATTCACCGTT
TCCGCAGCGA TGGAAGCGAA CGTAGTAAAT AATGCTTATG GTTATACCGA TAGCAAGGGT
AATTTTGTCG ATCAGTCCGA TCGTACCGGC TATGGTATGA GCATGACCTG GAATGGCCTG
AAAACGGATC CGGAAAATGG CGTCGTGGTT AATCTTAATA CCGCCTATTT AGATGCTAAT
AATGAGAAAG ATTTCACTGC CGGGATTAAC GCGCTGTGGA AACGTTTCGA GCTGGGTTAT
ATCTACGCGC ACAATAAGAT TGATGAATTC AGCGGTGTAG TTTGTGATAA CGACTGCTGG
ATTGATGATG AAGGGACGTA CACCATTCAC ACCATTCATG CGTCTTATCA GTTCGCTAAT
GTGATGGATA TGGAGAACTT TAATATTTAC CTCGGGACGT ATTACTCCAT TCTGGATAGC
GACGGTGATA AAAAACACGG TGATGATACT GATGACCGTT ACGGCGCACG CGTTCGCTTT
AAATACTTCT TCTGA
 
Protein sequence
MKKSTLSLAI GLLLACSTGM AKTQHLTLEQ RMALLEERLE AAEMRAAKAE SQVKQLQTQQ 
AAEIREIKAA QGNTPVNGQA TAESAKKNST SPNLLLSGYG DLKIYGDVEF NMDAESNHGL
LAMTNADVNS DPTNEQWNLN GRILLGFDGM RKLDNGYFAG FSAQPLGDMH GSVNIDDAVF
FFGKENDWKV KVGRFEAYDM FPLNQDTFVE HSGNTANDLY DDGSGYIYMM KEGRGRSNAG
GNFLVSKQLD NWYFELNTLL EDGTSLYNDG NYHGRDMEQQ KNVAYLRPVI AWSPTEEFTV
SAAMEANVVN NAYGYTDSKG NFVDQSDRTG YGMSMTWNGL KTDPENGVVV NLNTAYLDAN
NEKDFTAGIN ALWKRFELGY IYAHNKIDEF SGVVCDNDCW IDDEGTYTIH TIHASYQFAN
VMDMENFNIY LGTYYSILDS DGDKKHGDDT DDRYGARVRF KYFF