Gene EcSMS35_2216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2216 
Symbol 
ID6146396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2232062 
End bp2233822 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content50% 
IMG OID641617092 
Producthypothetical protein 
Protein accessionYP_001744266 
Protein GI170683951 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.522723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.321049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA CATTTATCCC CGGCAAAGAT GCCGCACTGG AAGATTCCAT CGCTCGCTTC 
CAGCAAAAAC TTTCAGACCT CGGCTTTCAG ATTGAAGAGG CCTCCTGGCT GAATCCCGTG
CCTAACGTCT GGTCTGTACA TATTCGCGAC AAAGAGTGCG CACTGTGTTT TACCAACGGT
AAAGGCGCAA CCAAAAAAGC GGCGCTGGCT TCTGCACTCG GTGAATATTT CGAGCGTCTC
TCAACCAACT ACTTTTTTGC GGACTTCTGG CTGGGCGAAA CCATCGCCAA CGGCCCATTC
GTGCATTATC CCAACGAAAA ATGGTTCCCA CTGACCGAAA ATGACGATGT ACCAGAAGGG
CTGCTCGATG ACCGTCTGCG CGCATTTTAC GATCCGGAGA ATGAACTGAC CGGTAGTATG
CTGATTGACC TGCAATCCGG TAACGAAGAT CGTGGTATTT GTGGTCTACC GTTTACGCGC
CAGTCCGACA ACCAGACCAT TTATATTCCG ATGAATATCA TTGGTAACTT GTACGTTTCT
AACGGTATGT CTGCTGGCAA TACCCGTAAC GAAGCACGCG TTCAGGGATT GTCCGAAGTT
TTCGAACGCT ACGTGAAAAA CCGCATTATT GCTGAAAGCA TCAGCTTGCC GGAAATCCCG
GCAGACGTGC TGGCGCGTTA CCCAGCGGTG GTTGAAGCGA TCGAAACACT GGAAGCAGAA
GGTTTCCCAA TCTTTGCTTA TGATGGTTCG CTTGGCGGCC AGTATCCGGT GATTTGCGTG
GTTCTGTTTA ATCCTGCTAA CGGCACCTGC TTTGCCTCTT TCGGTGCGCA TCCTGATTTT
GGCGTAGCAC TGGAACGTAC CGTGACCGAG CTGCTGCAAG GTCGTGGCCT GAAAGATTTG
GATGTGTTTA CTCCGCCAAC CTTCGATGAT GAAGAAGTTG CTGAACATAC CAACCTCGAA
ACGCACTTTA TCGATTCCAG CGGTTTAATC TCCTGGGACC TGTTCAAGCA GGATGCCGAT
TATCCGTTTG TGGACTGGAA TTTCTCCGGC ACTACGGAAG AAGAGTTTGC TACGCTGATG
GCTATCTTCA ACAAAGAAGA TAAAGAAGTT TATATTGCCG ATTACGAGCA TCTGGGCGTT
TATGCTTGCC GTATTATCGT GCCTGGCATG TCCGATATTT ATCCGGCTGA AGATCTGTGG
TTAGCGAATA ACAGTATGGG TAGCCATTTA CGTGAAACCA TTCTCTCGCT ACCAGGCAGT
GTGTGGGAAA AAGAAGATTA CCTGAACCTC ATCGAGCAAC TGGATGAAGA AGGTTTTGAT
GACTTTACTC GCGTACGTGA ACTACTGGGT CTGGCGACCG GACCGGATAA CGGTTGGTAC
ACCCTGCGTA TCGGTGAATT AAAAGCCATG CTGGCGCTGG CTGGTGGCGA TCTGGAACAG
GCTCTGGTCT GGACTGAATG GACGATGGAA TTTAACTCAT CAGTGTTCAG TCCGGAACGC
GCCAACTATT ATCGCTGTCT GCAAACGTTG TTATTACTGG CACAGGAAGA AGACCGCCAA
CCGCTGCAAT ATCTGAATGC CTTCGTTCGT ATGTATGGCG CAGACGCCGT GGAAGCCGCC
AGTGCGGCAA TGAGCGGCGA AGCGGCGTTT TATGGCTTGC AACCAGTAGA TAGCGATCTG
CACGCGTTTG CTGCACATCA ATCGTTGCTG AAAGCCTACG AAAAGCTGCA GCGCGCCAAA
GCGGCATTCT GGGCAAAATA A
 
Protein sequence
MTQTFIPGKD AALEDSIARF QQKLSDLGFQ IEEASWLNPV PNVWSVHIRD KECALCFTNG 
KGATKKAALA SALGEYFERL STNYFFADFW LGETIANGPF VHYPNEKWFP LTENDDVPEG
LLDDRLRAFY DPENELTGSM LIDLQSGNED RGICGLPFTR QSDNQTIYIP MNIIGNLYVS
NGMSAGNTRN EARVQGLSEV FERYVKNRII AESISLPEIP ADVLARYPAV VEAIETLEAE
GFPIFAYDGS LGGQYPVICV VLFNPANGTC FASFGAHPDF GVALERTVTE LLQGRGLKDL
DVFTPPTFDD EEVAEHTNLE THFIDSSGLI SWDLFKQDAD YPFVDWNFSG TTEEEFATLM
AIFNKEDKEV YIADYEHLGV YACRIIVPGM SDIYPAEDLW LANNSMGSHL RETILSLPGS
VWEKEDYLNL IEQLDEEGFD DFTRVRELLG LATGPDNGWY TLRIGELKAM LALAGGDLEQ
ALVWTEWTME FNSSVFSPER ANYYRCLQTL LLLAQEEDRQ PLQYLNAFVR MYGADAVEAA
SAAMSGEAAF YGLQPVDSDL HAFAAHQSLL KAYEKLQRAK AAFWAK