Gene EcSMS35_2322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2322 
Symbol 
ID6144317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2353487 
End bp2354473 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content55% 
IMG OID641617196 
ProductCobW/P47K family protein 
Protein accessionYP_001744369 
Protein GI170681990 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.366233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00913363 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCAGAA CCAACCTCAT CACCGGTTTT CTCGGCAGCG GGAAAACTAC GTCGATTCTT 
CATCTGTTAG CCCATAAAGA TCCCAACGAA AAATGGGCGG TACTGGTTAA TGAATTTGGG
GAAGTCGGAA TTGATGGTGC TTTGCTCGCC GATAGCGGCG CATTGTTGAA AGAGATCCCC
GGCGGCTGTA TGTGCTGCGT TAATGGTTTA CCCATGCAGG TAGGGTTGAA TACCTTACTG
CGTCAGGGAA AACCAGACCG CTTGTTGATA GAGCCGACCG GGCTGGGCCA TCCGAAACAG
ATCCTCGATC TCTTAACCGC GCCCGTGTAT GAACCGTGGA TAGATCTGCG CGCCACGTTG
TGCATTCTCG ATCCACGCCT GCTGCTGGAC GAAAAAAGCG CCAGCAATGA AAACTTCCGT
GACCAGCTGG CTGCCGCAGA CATTATTGTC GCCAATAAAT CCGACCGTGC GACGCCCGAA
AGTGAGCAAG CGCTACAGCG CTGGTGGCAG CAAAATGGTG GCGATCGGCA ATTAATTCAC
AGCGCGCATG GGAAAGTTGA CGGTCATCTT CTGGATTTGC CGCGTCGCAA TTTAGCCGAG
TTGCCCGCCA GCGCCGCGCA TTCTCATCAG CATGTCGTGA AAAAAGGGTT AGCAGCGTTA
AGCCTGCCAG AGCATCAACG CTGGCGTCGC AGTCTGAACA GCGGGCAAGG ATATCAGGCC
TGCGGCTGGA TATTCGACGC TGATACGGTA TTCGACACCA TTGGCATTCT GGAATGGGCG
CGACTTGCAC CGGTAGAACG CGTCAAAGGC GTGCTGCGTA TTCCCGAAGG GCTGGTGCGA
ATCAACCGTC AGGGCGATGA CCTGCACATT GAAACGCAAA ACGTTGCGCC ACCGGACAGC
CGTATTGAGC TGATTTCCAG CAGCGAAGCT GACTGGAATG CCTTGCAGAG CGCGCTGTTG
AAGCTTCGTT TAGCGACTAC CGCGTAA
 
Protein sequence
MTRTNLITGF LGSGKTTSIL HLLAHKDPNE KWAVLVNEFG EVGIDGALLA DSGALLKEIP 
GGCMCCVNGL PMQVGLNTLL RQGKPDRLLI EPTGLGHPKQ ILDLLTAPVY EPWIDLRATL
CILDPRLLLD EKSASNENFR DQLAAADIIV ANKSDRATPE SEQALQRWWQ QNGGDRQLIH
SAHGKVDGHL LDLPRRNLAE LPASAAHSHQ HVVKKGLAAL SLPEHQRWRR SLNSGQGYQA
CGWIFDADTV FDTIGILEWA RLAPVERVKG VLRIPEGLVR INRQGDDLHI ETQNVAPPDS
RIELISSSEA DWNALQSALL KLRLATTA