Gene ECH74115_5177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5177 
SymbolgidA 
ID6970499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4820283 
End bp4822172 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content54% 
IMG OID643388843 
ProducttRNA uridine 5-carboxymethylaminomethyl modification enzyme GidA 
Protein accessionYP_002273269 
Protein GI209400287 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0445] NAD/FAD-utilizing enzyme apparently involved in cell division 
TIGRFAM ID[TIGR00136] glucose-inhibited division protein A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00356313 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.05099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTATC CGGATCCTTT TGACGTCATC ATCATTGGCG GGGGTCATGC AGGCACCGAG 
GCCGCGATGG CTGCGGCGCG TATGGGTCAA CAGACTCTGC TTTTGACACA CAATATCGAC
ACTCTGGGGC AGATGAGCTG CAACCCGGCG ATCGGCGGTA TTGGGAAGGG ACATCTGGTA
AAAGAAGTGG ATGCACTCGG CGGTCTGATG GCGAAAGCGA TCGATCAGGC GGGTATCCAG
TTTAGGATAC TAAACGCAAG CAAGGGACCG GCAGTTCGCG CTACCCGAGC TCAGGCGGAT
CGTGTGCTCT ACCGTCAGGC GGTACGTACG GCGCTGGAGA ACCAACCGAA CCTGATGATC
TTCCAGCAGG CGGTTGAAGA TCTTATTGTC GAAAACGATC GTGTGGTCGG TGCTGTTACC
CAAATGGGAC TGAAGTTCCG TGCCAAAGCC GTCGTGCTCA CCGTTGGGAC GTTCCTCGAC
GGTAAAATTC ATATCGGTCT GGATAACTAC AGCGGTGGCC GTGCTGGTGA TCCGCCGTCC
ATTCCGCTTT CTCGCCGTTT GCGTGAACTT CCGCTGCGCG TTGGTCGTCT GAAAACCGGG
ACACCACCGC GTATTGATGC TCGAACCATC GACTTTAGCG TGCTTGCGCA ACAGCATGGC
GATAACCCAA TGCCGGTATT CTCTTTTATG GGCAATGCGT CCCAGCATCC ACAGCAGGTG
CCGTGTTATA TCACCCATAC CAACGAGAAA ACCCATGATG TGATCCGCAG TAACCTCGAT
CGTAGCCCAA TGTACGCAGG GGTGATCGAA GGTGTCGGCC CACGCTACTG CCCGTCGATC
GAAGACAAAG TCATGCGCTT CGCCGACAGA AATCAGCATC AGATCTTCCT TGAACCGGAA
GGGCTGACTT CTAACGAAAT TTATCCGAAC GGTATCTCCA CCAGCCTGCC GTTCGATGTG
CAGATGCAAA TCGTCCGCTC TATGCAGGGG ATGGAAAACG CGAAGATCGT GCGTCCGGGT
TATGCCATTG AGTATGACTT CTTCGATCCT CGCGACCTGA AACCGACGCT GGAGAGCAAG
TTTATCCAGG GGCTGTTCTT TGCTGGTCAG ATTAACGGCA CTACCGGTTA CGAAGAAGCC
GCTGCGCAAG GTTTGCTGGC CGGTCTTAAC GCTGCCCGTC TGTCTGATGA CAAAGAAGGT
TGGGCTCCGG CGCGTTCTCA GGCCTATCTC GGCGTACTGG TTGATGACCT GTGCACTTTA
GGAACCAAAG AACCGTATCG TATGTTTACC TCGCGCGCAG AATATCGTCT GATGCTGCGC
GAAGATAATG CGGATCTGCG TTTGACTGAA ATCGGTCGTG AACTGGGCCT GGTGGATGAC
GAACGTTGGG CGCGCTTTAA CGAGAAACTT GAGAATATCG AGCGTGAGCG TCAGCGTCTG
AAATCGACCT GGGTAACCCC GTCGGCGGAA GCTGCAGCCG AAGTGAATGC TCACCTGACT
GCGCCGCTTT CCCGTGAAGC CAGTGGTGAA GATCTGCTGC GTCGTCCGGA AATGACTTAT
GAAAAATTAA CCACGCTGAC GCCGTTTGCC CCTGCGTTGA CAGACGAACA GGCGGCGGAA
CAGGTTGAGA TTCAGGTTAA ATACGAAGGT TATATCGCGC GCCAGCAAGA TGAGATCGAA
AAGCAGCTGC GTAACGAGAA CACCTTGCTA CCCGCGACAC TGGATTACCG CCAGGTATCC
GGTCTTTCTA ACGAAGTGAT CGCCAAACTT AACGATCACA AACCGGCCTC TATCGGCCAG
GCTTCACGTA TTTCTGGCGT CACGCCTGCG GCCATCTCCA TTCTGCTGGT GTGGCTGAAA
AAACAGGGTA TGCTGCGTCG TAGCGCATAA
 
Protein sequence
MFYPDPFDVI IIGGGHAGTE AAMAAARMGQ QTLLLTHNID TLGQMSCNPA IGGIGKGHLV 
KEVDALGGLM AKAIDQAGIQ FRILNASKGP AVRATRAQAD RVLYRQAVRT ALENQPNLMI
FQQAVEDLIV ENDRVVGAVT QMGLKFRAKA VVLTVGTFLD GKIHIGLDNY SGGRAGDPPS
IPLSRRLREL PLRVGRLKTG TPPRIDARTI DFSVLAQQHG DNPMPVFSFM GNASQHPQQV
PCYITHTNEK THDVIRSNLD RSPMYAGVIE GVGPRYCPSI EDKVMRFADR NQHQIFLEPE
GLTSNEIYPN GISTSLPFDV QMQIVRSMQG MENAKIVRPG YAIEYDFFDP RDLKPTLESK
FIQGLFFAGQ INGTTGYEEA AAQGLLAGLN AARLSDDKEG WAPARSQAYL GVLVDDLCTL
GTKEPYRMFT SRAEYRLMLR EDNADLRLTE IGRELGLVDD ERWARFNEKL ENIERERQRL
KSTWVTPSAE AAAEVNAHLT APLSREASGE DLLRRPEMTY EKLTTLTPFA PALTDEQAAE
QVEIQVKYEG YIARQQDEIE KQLRNENTLL PATLDYRQVS GLSNEVIAKL NDHKPASIGQ
ASRISGVTPA AISILLVWLK KQGMLRRSA