Gene EcSMS35_3664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3664 
SymboldamX 
ID6143251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3722809 
End bp3724095 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content55% 
IMG OID641618491 
Producthypothetical protein 
Protein accessionYP_001745631 
Protein GI170683150 
COG category[S] Function unknown 
COG ID[COG3266] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00402866 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.034061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAT TCAAACCAGA AGACGAGCTG AAACCCGATC CCAGCGATCG TCGTACTGGT 
CGTTCTCGTC AATCTTCTGA ACGTTCTGAG CGTACTGAAC GTGGCGAACC GCAGATCAAT
TTTGATGATA TTGAACTTGA TGACACTGAC GATCGCCGTC CGACTCGTGC GCAAAAAGAG
CGCAATGATG AGCCGGAAAT CGAAGAAGAA ATTGACGAAT CCGAAGATGA AATCGTGGAT
GAAGAGCGCG TAGAGCGTCG TCCGCGTAAG CGCAAAAAAG CAGCCAGTAA ACCCGCTTCT
CGTCAGTATA TGATGATGGG GGTCGGCATT CTGGTTCTCC TACTGTTGAT CATCGGTATC
GGTTCTGCGC TAAAAGCCCC CTCGACCTCT TCCAGCGATC AAACCGCGTC TGGCGAGAAG
AGTATTGATC TTGCAGGCAA TGCGACCGAT CAGGCGAATG GTGTGCAGCC AGCGCCGGGA
ACCACGTCTG CGGAAAATAC TCAGCAGGAT GTTTCTCTGC CGCCGATCTC TTCTACGCCG
ACTCAAGGGC AAACCCCGGC GGCAACGGAT GGTCAACAAC GTGTTGAAGT GCAGGGTGAC
CTGAACAATG CGCTGACCCA GCCACAAAAT CAGCAACAGT TGAATAATGT GGCGGTCAAT
TCCACGTTGC CGACTGAACC CGCGACGGTC GCGCCTGTTC GCAATGGCAA TGCATCGCGT
GACACGGCGA AAACGCAAAC CGCTGAACGT CCGGCCACTA CGCGTCCAGC TCGCCAGCAG
GCGGTGATTG AACCGAAAAA ACCGCAAGCA ACCGTGAAAA CGGAGCCGAA GCCGGTAGCA
CAGACGCCGA AGCGTACTGA ACCAGCTGCT CCTGTGGCAA GCACGAAGGC ACCGGCTGCG
ACCTCTACGC CAGCACCAAA AGAGACCGCA ACTACGGCTC CAGTACAGAC GGCATCCCCG
GCGCAAACCA CGGCAACACC AGCCGCTGGA GGGAAGACCG CAGGTAATGT GGGTTCGTTG
AAATCGGCAC CGTCCAGCCA TTACACTCTG CAGCTGAGCA GTTCCTCTAA CTACGACAAC
CTGAACGGTT GGGCGAAGAA AGAGAATCTG AAAAACTACG TTGTCTATGA AACGACGCGT
AATGGTCAGC CGTGGTATGT CCTGGTTTCT GGCGTGTACG CTTCGAAAGA AGAGGCGAAA
AAAGCGGTAT CTACATTGCC AGCCGATGTC CAGGCCAAAA ACCCGTGGGC GAAACCGCTG
CGTCAGGTAC AGGCCGATCT GAAGTAA
 
Protein sequence
MDEFKPEDEL KPDPSDRRTG RSRQSSERSE RTERGEPQIN FDDIELDDTD DRRPTRAQKE 
RNDEPEIEEE IDESEDEIVD EERVERRPRK RKKAASKPAS RQYMMMGVGI LVLLLLIIGI
GSALKAPSTS SSDQTASGEK SIDLAGNATD QANGVQPAPG TTSAENTQQD VSLPPISSTP
TQGQTPAATD GQQRVEVQGD LNNALTQPQN QQQLNNVAVN STLPTEPATV APVRNGNASR
DTAKTQTAER PATTRPARQQ AVIEPKKPQA TVKTEPKPVA QTPKRTEPAA PVASTKAPAA
TSTPAPKETA TTAPVQTASP AQTTATPAAG GKTAGNVGSL KSAPSSHYTL QLSSSSNYDN
LNGWAKKENL KNYVVYETTR NGQPWYVLVS GVYASKEEAK KAVSTLPADV QAKNPWAKPL
RQVQADLK