Gene EcSMS35_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4067 
SymboldnaA 
ID6146687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4158994 
End bp4160397 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID641618892 
Productchromosomal replication initiation protein 
Protein accessionYP_001746030 
Protein GI170679718 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000102031 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCACTTT CGCTTTGGCA GCAGTGTCTT GCCCGATTGC AGGATGAGTT ACCAGCCACA 
GAATTCAGTA TGTGGATACG CCCATTGCAG GCGGAACTGA GCGATAACAC GCTGGCCCTG
TACGCGCCAA ACCGTTTTGT CCTCGATTGG GTACGGGACA AGTACCTTAA TAATATCAAT
GGACTGCTAA CCAGTTTCTG CGGAGCGGAT GCCCCACAGC TGCGTTTTGA AGTCGGCACC
AAACCGGTGA CGCAAACGCC ACAAGCGGCA GTGACGAGCA ACGTCGCGGC CCCTGCACAG
GTGGCGCAAA CGCAGCCGCA ACGTGCTGCG CCTTCTACGC GCTCGGGTTG GGATAACGTC
CCGGCTCCGG CAGAACCGAC CTATCGTTCT AACGTAAACG TCAAACACAC GTTTGATAAC
TTCGTTGAAG GTAAATCTAA CCAACTGGCG CGCGCGGCGG CTCGCCAGGT GGCAGATAAC
CCTGGCGGTG CTTATAACCC GTTGTTCCTT TATGGCGGCA CGGGTTTGGG TAAAACTCAC
CTGCTGCATG CGGTGGGTAA CGGCATTATG GCGCGCAAGC CGAATGCCAA AGTGGTTTAT
ATGCACTCCG AGCGCTTTGT TCAGGACATG GTTAAAGCCC TGCAAAACAA CGCGATCGAA
GAGTTTAAAC GCTACTACCG TTCCGTAGAT GCACTGCTGA TCGACGATAT TCAGTTTTTT
GCTAATAAAG AACGATCTCA GGAAGAGTTT TTCCACACCT TCAACGCCCT GCTGGAAGGT
AATCAACAGA TCATTCTCAC CTCCGATCGC TATCCGAAAG AGATCAACGG CGTTGAGGAT
CGTTTGAAAT CCCGCTTCGG CTGGGGACTG ACTGTGGCGA TCGAACCGCC AGAGCTGGAA
ACCCGCGTGG CGATCCTGAT GAAAAAGGCC GACGAAAACG ACATTCGTTT GCCGGGCGAA
GTGGCGTTCT TTATCGCCAA GCGTCTACGA TCTAACGTAC GTGAGCTGGA AGGCGCGCTG
AACCGCGTTA TTGCTAACGC CAACTTTACC GGACGTGCGA TCACCATCGA CTTCGTGCGT
GAGGCGCTGC GCGACTTGCT GGCATTGCAG GAAAAACTGG TCACCATCGA CAATATTCAG
AAGACGGTGG CGGAGTACTA CAAGATCAAA GTCGCGGATC TCCTTTCCAA GCGTCGATCC
CGCTCGGTGG CGCGTCCGCG CCAGATGGCG ATGGCGCTGG CAAAAGAACT GACTAACCAC
AGTCTGCCGG AGATTGGCGA TGCGTTTGGT GGCCGTGACC ACACGACGGT GCTTCATGCC
TGCCGTAAGA TCGAGCAGCT GCGTGAAGAG AGCCACGATA TCAAAGAAGA TTTTTCCAAT
TTAATCAGAA CATTGTCATC GTAA
 
Protein sequence
MSLSLWQQCL ARLQDELPAT EFSMWIRPLQ AELSDNTLAL YAPNRFVLDW VRDKYLNNIN 
GLLTSFCGAD APQLRFEVGT KPVTQTPQAA VTSNVAAPAQ VAQTQPQRAA PSTRSGWDNV
PAPAEPTYRS NVNVKHTFDN FVEGKSNQLA RAAARQVADN PGGAYNPLFL YGGTGLGKTH
LLHAVGNGIM ARKPNAKVVY MHSERFVQDM VKALQNNAIE EFKRYYRSVD ALLIDDIQFF
ANKERSQEEF FHTFNALLEG NQQIILTSDR YPKEINGVED RLKSRFGWGL TVAIEPPELE
TRVAILMKKA DENDIRLPGE VAFFIAKRLR SNVRELEGAL NRVIANANFT GRAITIDFVR
EALRDLLALQ EKLVTIDNIQ KTVAEYYKIK VADLLSKRRS RSVARPRQMA MALAKELTNH
SLPEIGDAFG GRDHTTVLHA CRKIEQLREE SHDIKEDFSN LIRTLSS