Gene EcSMS35_2935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2935 
Symbol 
ID6143346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3009559 
End bp3010923 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content51% 
IMG OID641617804 
Producthypothetical protein 
Protein accessionYP_001744959 
Protein GI170680380 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.153502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTACAC ATATTAGCCC GCTTGGCTCC ATGGATATGT TGTCGCAGCT GGAAGTGGAT 
ATGCTTAAAC GCACCGCCAG CAGCGACCTC TATCAACTGT TTCGCAACTG TTCACTTGCC
GTACTGAACT CCGGTAGTTT GACCGATAAC AGCAAAGAAT TGCTGTCTCG TTTTGAAAAT
TTCGATATTA ACGTCCTGCG CCGTGAACGC GGCGTAAAGC TGGAACTGAT TAATCCCCCG
GAAGAGGCTT TTGTCGATGG GCGAATTATT CGCGCTTTGC AGGCCAACTT GTTCGCGGTT
CTGCGCGACA TTCTCTTCGT TTACGGGCAA ATCCATAACA CCGTTCGTTT TCCCAACCTG
AATCTCGACA ACTCCGTCCA CATCACTAAC CTGGTCTTTT CCATCTTGCG TAACGCTCGC
GCGCTGCATG TTGGTGAAGC GCCAAATATG GTGGTCTGCT GGGGCGGGCA CTCAATTAAC
GAAAACGAGT ATTTGTATGC CCGTCGCGTC GGGAATCAGC TGGGCCTGCG TGAGCTGAAT
ATCTGCACCG GCTGTGGTCC GGGAGCGATG GAAGCGCCAA TGAAAGGTGC TGCGGTCGGA
CACGCACAGC AGCGTTACAA AGACAGTCGT TTTATTGGTA TGACAGAGCC ATCGATTATC
GCCGCTGAAC CGCCTAACCC GCTGGTCAAC GAATTGATCA TCATGCCGGA TATCGAAAAA
CGTCTGGAAG CGTTTGTCCG TATCGCTCAC GGTATCATTA TCTTCCCTGG CGGTGTGGGT
ACGGCAGAAG AGTTGCTGTA TTTGCTGGGA ATTTTAATGA ACCCGGCCAA CAAAGATCAG
GTTTTACCAT TGCTCCTCAC CGGCCCGAAA GAGAGCGCCG ACTACTTCCG CGTACTGGAC
GAGTTTGTCG TGCATACGTT GGGTGAAAAC GCGCGCCGCC ATTACCGCAT CATCATTGAT
GACGCCGCTG AAGTCGCTCG TCAGATGAAA AAATCGATGC CGCTGGTGAA AGAAAATCGC
CGCGACACAG GCGATGCCTA CAGCTTTAAC TGGTCAATGC GCATTGCGCC AGATTTGCAA
ATGCCGTTTG AGCCGTCTCA CGAGAATATG GCTAATCTGA AGCTTTACCC GGATCAACCC
GTTGAAGTGC TGGCTGCCGA CCTGCGCCGT GCGTTCTCCG GTATTGTGGC GGGTAACGTA
AAAGAAGTCG GTATTCGCGC CATTGAAGAG TTTGGTCCTT ATAAAATCAA CGGCGATAAA
GAGATTATGC GTCGTATGGA TGACCTGCTA CAGGGTTTTG TTGCCCAGCA TCGTATGAAG
TTGCCAGGCT CAGCCTACAT CCCTTGCTAC GAAATCTGCA CGTAA
 
Protein sequence
MITHISPLGS MDMLSQLEVD MLKRTASSDL YQLFRNCSLA VLNSGSLTDN SKELLSRFEN 
FDINVLRRER GVKLELINPP EEAFVDGRII RALQANLFAV LRDILFVYGQ IHNTVRFPNL
NLDNSVHITN LVFSILRNAR ALHVGEAPNM VVCWGGHSIN ENEYLYARRV GNQLGLRELN
ICTGCGPGAM EAPMKGAAVG HAQQRYKDSR FIGMTEPSII AAEPPNPLVN ELIIMPDIEK
RLEAFVRIAH GIIIFPGGVG TAEELLYLLG ILMNPANKDQ VLPLLLTGPK ESADYFRVLD
EFVVHTLGEN ARRHYRIIID DAAEVARQMK KSMPLVKENR RDTGDAYSFN WSMRIAPDLQ
MPFEPSHENM ANLKLYPDQP VEVLAADLRR AFSGIVAGNV KEVGIRAIEE FGPYKINGDK
EIMRRMDDLL QGFVAQHRMK LPGSAYIPCY EICT