Gene EcSMS35_3528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3528 
Symbol 
ID6145973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3605795 
End bp3606922 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content52% 
IMG OID641618357 
ProductAFG1 family ATPase 
Protein accessionYP_001745504 
Protein GI170683950 
COG category[R] General function prediction only 
COG ID[COG1485] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000142323 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGCG TTACCCCAAC ATCGCAATAC CTGAAGGCGC TCAATGAAGG CAGCCATCAA 
CCCGACGACG TTCAAAAAGA GGCCGTCAGC CGCCTGGAAA TTATTTATCA GGAACTCATC
AATAGCACGC CTCCAGCCCC TAGGACGAGT GGGCTAATGG CGCGGGTCGG TAAGCTGTGG
AGTAAACGCG AAGACACAAA GCATACGCCA GTGCGTGGCT TATATATGTG GGGCGGTGTA
GGACGCGGGA AAACCTGGCT GATGGACCTT TTCTATCAAA GCCTGCCGGG AGAGCGGAAA
CAGCGTCTGC ACTTTCACCG TTTTATGCTG CGGGTGCACG AAGAGCTGAC CGAATTACAA
GGCCAAAGCG ATCCGCTGGA AATTATTGCC GATCGCTTTA AAGCAGAAAC TGACGTGCTC
TGTTTTGACG AATTTTTTGT TTCTGATATT ACCGATGCCA TGCTACTTGG CGGTCTGATG
AAAGCCCTGT TCGCCCGAGG TATTACCCTG GTAGCGACGT CAAATATTCC GCCGGACGAA
CTTTATCGAA ATGGCCTACA ACGTGCGCGT TTTTTGCCTG CAATCGATGC CATTAAACAG
CATTGTGATG TAATGAACGT GGACGCTGGT GTTGATTATC GACTGCGTAC ACTCACTCAG
GCGCATCTGT GGCTTTCGCC CCTCAACGAT GAAACCCGGG CGCAAATGGA TAAACTATGG
TTGGCGCTGG CGGGGGCGAA ACGAGAAAAT TCACCGACAT TAGAAATCAA CCATCGGCCA
TTGGCGACAA TGGGCGTCGA GAACCAGACG CTGGCGGTCT CTTTTACTAC GCTGTGCGTC
GACGCCCGCA GTCAGCATGA CTATATTGCG CTCTCACGCC TCTTTCACAC GGTCATGTTG
TTTGATGTAC CAGTTATGAC GCGGTTGATG GAGAGCGAAG CGCGGCGCTT TATTGCGCTG
GTGGATGAGT TTTACGAGCG CCATGTCAAA TTAGTGGTGA GTGCAGAAGT GCCGCTATAT
GACATTTATC AGGGCGAGCG GCTGAAATTT GAGTTCCAGC GTTGCCTGTC ACGTCTGCAA
GAGATGCAAA GCGAAGAGTA TCTGAAGCGC GAGCATTTAG CGGGTTAA
 
Protein sequence
MQSVTPTSQY LKALNEGSHQ PDDVQKEAVS RLEIIYQELI NSTPPAPRTS GLMARVGKLW 
SKREDTKHTP VRGLYMWGGV GRGKTWLMDL FYQSLPGERK QRLHFHRFML RVHEELTELQ
GQSDPLEIIA DRFKAETDVL CFDEFFVSDI TDAMLLGGLM KALFARGITL VATSNIPPDE
LYRNGLQRAR FLPAIDAIKQ HCDVMNVDAG VDYRLRTLTQ AHLWLSPLND ETRAQMDKLW
LALAGAKREN SPTLEINHRP LATMGVENQT LAVSFTTLCV DARSQHDYIA LSRLFHTVML
FDVPVMTRLM ESEARRFIAL VDEFYERHVK LVVSAEVPLY DIYQGERLKF EFQRCLSRLQ
EMQSEEYLKR EHLAG