Gene EcSMS35_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4109 
SymbolgidA 
ID6146368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4203543 
End bp4205432 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content55% 
IMG OID641618933 
ProducttRNA uridine 5-carboxymethylaminomethyl modification enzyme GidA 
Protein accessionYP_001746071 
Protein GI170680590 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0445] NAD/FAD-utilizing enzyme apparently involved in cell division 
TIGRFAM ID[TIGR00136] glucose-inhibited division protein A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0017161 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.124027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTATC CGGATCCTTT TGACGTCATC ATCATTGGCG GGGGTCATGC AGGCACCGAG 
GCCGCGATGG CCGCGGCGCG TATGGGTCAA CAGACTCTGC TTTTGACACA CAATATCGAC
ACTCTGGGGC AGATGAGCTG CAACCCGGCG ATCGGCGGTA TTGGGAAGGG ACATCTGGTA
AAAGAAGTGG ATGCACTCGG CGGTCTGATG GCGAAAGCGA TCGATCTGGC GGGTATCCAG
TTTAGGATAC TAAACGCAAG TAAGGGACCG GCAGTTCGCG CTACCCGAGC TCAGGCGGAT
CGTGTGCTCT ACCGGCAGGC GGTACGTACG GCGCTGGAGA ACCAACCGAA CCTGATGATC
TTCCAGCAGG CAGTTGAAGA TCTTATTGTC GAAAACGATC GCGTGGTCGG AGCCGTTACC
CAAATGGGAC TGAAGTTCCG TGCCAAAGCT GTCGTGCTCA CCGTTGGGAC GTTCCTCGAC
GGTAAAATTC ATATCGGTCT GGATAACTAC AGCGGTGGCC GTGCTGGTGA TCCGCCGTCC
ATTCCGCTTT CTCGCCGTTT GCGTGAACTG CCGCTGCGCG TTGGTCGTCT GAAAACCGGG
ACACCACCGC GTATTGATGC TCGAACCATC GACTTTAGCG TACTGGCGCA ACAGCATGGC
GATAACCCAA TGCCGGTATT CTCGTTTATG GGCAATGCGT CCCAGCATCC CCAGCAGGTG
CCGTGTTATA TCACTCATAC CAACGAGAAA ACCCATGATG TGATCCGCAG TAACCTCGAT
CGTAGCCCAA TGTACGCAGG GGTGATCGAA GGTGTCGGCC CACGCTACTG CCCGTCGATC
GAAGACAAAG TCATGCGCTT CGCCGACAGA AATCAGCATC AGATCTTCCT TGAACCGGAA
GGGCTGACCT CTAACGAAAT TTATCCGAAC GGTATCTCCA CCAGCCTGCC GTTCGATGTG
CAGATGCAAA TCGTCCGCTC CATGCAGGGG ATGGAAAACG CGAAGATCGT GCGTCCGGGT
TATGCCATTG AGTATGACTT CTTCGATCCA CGCGACCTGA AACCGACGCT GGAGAGCAAG
TTTATCCAGG GGCTGTTCTT TGCTGGTCAG ATTAACGGCA CTACCGGTTA CGAAGAAGCC
GCTGCGCAAG GTTTGCTGGC TGGTCTTAAC GCTGCCCGTC TGTCTGCAGA CAAAGAAGGT
TGGGCTCCGG CGCGTTCTCA GGCGTATCTC GGCGTACTGG TTGATGACCT GTGCACTTTA
GGAACCAAAG AACCGTATCG TATGTTTACC TCGCGCGCAG AATATCGTCT GATGCTGCGC
GAAGATAATG CGGATCTGCG TTTGACTGAA ATCGGTCGTG AACTGGGCCT GGTGGATGAC
GAACGTTGGG CGCGCTTTAA CGAGAAACTT GAGAATATCG AGCGTGAGCG TCAGCGTCTG
AAATCGACCT GGGTAACCCC GTCGGCGGAA GCTGCAGCCG AAGTGAATGC TCACCTGACT
GCGCCACTTT CCCGTGAAGC CAGTGGTGAA GATCTGCTGC GTCGTCCGGA AATGACTTAT
GAAAAATTAA CCACGCTGAC GCCGTTTGCC CCTGCGTTGA CAGACGAACA GGCGGCGGAA
CAGGTTGAGA TTCAGGTTAA ATACGAAGGT TATATCGCGC GCCAGCAAGA TGAGATCGAA
AAGCAGCTGC GTAACGAGAA CACCCTGCTA CCAGCGACGC TGGATTACCG CCAGGTATCC
GGTCTTTCTA ACGAAGTGAT CGCCAAACTT AACGATCACA AACCGGCCTC TATCGGTCAG
GCTTCGCGTA TTTCTGGCGT CACGCCTGCG GCCATCTCCA TTCTGCTGGT GTGGCTGAAA
AAACAGGGTA TGCTGCGTCG TAGCGCATAA
 
Protein sequence
MFYPDPFDVI IIGGGHAGTE AAMAAARMGQ QTLLLTHNID TLGQMSCNPA IGGIGKGHLV 
KEVDALGGLM AKAIDLAGIQ FRILNASKGP AVRATRAQAD RVLYRQAVRT ALENQPNLMI
FQQAVEDLIV ENDRVVGAVT QMGLKFRAKA VVLTVGTFLD GKIHIGLDNY SGGRAGDPPS
IPLSRRLREL PLRVGRLKTG TPPRIDARTI DFSVLAQQHG DNPMPVFSFM GNASQHPQQV
PCYITHTNEK THDVIRSNLD RSPMYAGVIE GVGPRYCPSI EDKVMRFADR NQHQIFLEPE
GLTSNEIYPN GISTSLPFDV QMQIVRSMQG MENAKIVRPG YAIEYDFFDP RDLKPTLESK
FIQGLFFAGQ INGTTGYEEA AAQGLLAGLN AARLSADKEG WAPARSQAYL GVLVDDLCTL
GTKEPYRMFT SRAEYRLMLR EDNADLRLTE IGRELGLVDD ERWARFNEKL ENIERERQRL
KSTWVTPSAE AAAEVNAHLT APLSREASGE DLLRRPEMTY EKLTTLTPFA PALTDEQAAE
QVEIQVKYEG YIARQQDEIE KQLRNENTLL PATLDYRQVS GLSNEVIAKL NDHKPASIGQ
ASRISGVTPA AISILLVWLK KQGMLRRSA