Gene EcSMS35_3479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3479 
SymbolobgE 
ID6145638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3557194 
End bp3558366 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content53% 
IMG OID641618308 
ProductGTPase ObgE 
Protein accessionYP_001745455 
Protein GI170683110 
COG category[R] General function prediction only 
COG ID[COG0536] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02729] Obg family GTPase CgtA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000528393 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTG TTGATGAAGC ATCGATTCTG GTCGTTGCAG GTGATGGCGG TAATGGTTGC 
GTGAGCTTCC GCCGCGAAAA GTATATTCCG AAAGGCGGCC CGGATGGCGG CGACGGCGGT
GACGGCGGTG ACGTATGGAT GGAAGCCGAC GAGAACCTGA ACACGCTTAT CGATTATCGT
TTTGAAAAAT CTTTCCGCGC AGAGCGTGGT CAGAATGGCG CAAGCCGCGA CTGTACCGGT
AAGCGCGGTA AAGACGTGAC GATTAAAGTG CCGGTAGGTA CGCGTGTTAT CGACCAGGGT
ACCGGTGAAA CCATGGGCGA TATGACCAAA CACGGTCAGC GTCTGCTGGT TGCTAAGGGC
GGCTGGCACG GTCTGGGCAA TACCCGTTTC AAATCGTCAG TTAACCGTAC ACCACGGCAG
AAAACTAACG GTACGCCGGG CGATAAGCGT GAGCTGCTGC TGGAGCTGAT GCTGCTGGCT
GACGTCGGTA TGTTGGGGAT GCCAAACGCG GGTAAATCGA CCTTTATTCG TGCGGTATCG
GCGGCTAAAC CGAAAGTGGC GGATTATCCG TTTACCACTC TGGTGCCAAG TCTTGGTGTG
GTACGAATGG ACAACGAAAA GAGCTTCGTT GTTGCCGATA TTCCAGGACT GATTGAAGGC
GCTGCGGAAG GCGCTGGCCT GGGCATTCGC TTCCTGAAGC ACCTGGAGCG TTGCCGCGTT
CTGTTGCACC TCATCGATAT CGATCCGATT GACGGCACCG ATCCGGTTGA AAACGCGCGT
ATTATTATCA GCGAGCTGGA AAAATACAGC CAGGATCTGG CGGCGAAACC GCGTTGGTTA
GTCTTCAACA AGATCGATCT GCTGGATAAG GCAGAAGCCG AAGAGAAAGC GAAAGCGATC
GCTGAAGCGC TGGGCTGGGA AGATAAATAT TATCTGATCT CTGCGGCGAG CGGACTGGGC
GTGAAAGATC TCTGCTGGGA TGTGATGACC TTTATCATTG AAAACCCGGT CGTGCAGGCT
GAAGAAGCGA AACAGCCAGA GAAAGTCGAA TTCATGTGGG ATGATTATCA CCGTCAGCAG
CTTGAAGAGA TTGCTGAAGA GGATGATGAA GACTGGGATG ACGACTGGGA CGAAGACGAC
GAAGAAGGCG TTGAGTTCAT TTACAAGCGT TAA
 
Protein sequence
MKFVDEASIL VVAGDGGNGC VSFRREKYIP KGGPDGGDGG DGGDVWMEAD ENLNTLIDYR 
FEKSFRAERG QNGASRDCTG KRGKDVTIKV PVGTRVIDQG TGETMGDMTK HGQRLLVAKG
GWHGLGNTRF KSSVNRTPRQ KTNGTPGDKR ELLLELMLLA DVGMLGMPNA GKSTFIRAVS
AAKPKVADYP FTTLVPSLGV VRMDNEKSFV VADIPGLIEG AAEGAGLGIR FLKHLERCRV
LLHLIDIDPI DGTDPVENAR IIISELEKYS QDLAAKPRWL VFNKIDLLDK AEAEEKAKAI
AEALGWEDKY YLISAASGLG VKDLCWDVMT FIIENPVVQA EEAKQPEKVE FMWDDYHRQQ
LEEIAEEDDE DWDDDWDEDD EEGVEFIYKR