Gene EcSMS35_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0794 
Symbol 
ID6143163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp795514 
End bp797775 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content51% 
IMG OID641615682 
Producthypothetical protein 
Protein accessionYP_001742874 
Protein GI170683967 
COG category[C] Energy production and conversion 
COG ID[COG1048] Aconitase A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGT TATCTGAAAA AGGCGTGTTT CTCGCCAGTA ATAACGAAAT AATTGCCGAA 
GAACATTTCA CCGGTGAAAT TAAAAAAGAA GAAGCCAAAA AAGGCACTAT TGCCTGGTCT
ATTCTCTCTT CTCATAATAC GTCTGGTAAT ATGGATAAAC TTAAAATTAA GTTTGATTCA
TTAGCCTCTC ACGATATTAC CTTTGTTGGT ATTGTACAGA CCGCTAAAGC GTCCGGTATG
GAACGTTTCC CGCTGCCGTA TGTGCTGACC AACTGCCATA ACTCACTCTG CGCCGTCGGC
GGTACTATTA ATGGCGATGA CCATATTTTT GGCTTATCGG CAGCGCAACG TTATGGCGGT
ATTTTTGTGC CACCGCATAT TGCGGTCATC CATCAATATA TGCGTGAAAT GATGGCGGGC
GGCGGCAAAA TGATTCTCGG GTCAGACAGC CATACCCGTT ACGGTGCATT AGGGACAATG
GCAGTCGGTG AGGGTGGCGG TGAGTTGGTA AAACAGCTGC TTAATGACAC CTGGGATATC
GACTATCCGG GAGTTGTTGC GGTGCATTTG ACCGGAAAAC CTGCGCCGTA TGTGGGGCCA
CAGGATGTTG CGCTGGCTAT CATCGGTGCC GTGTTCAAAA ACGGCTACGT CAAAAACAAA
GTGATGGAAT TCGTAGGTCC CGGTGTTGCT GCGCTCTCTA CCGATTTCCG TAACAGCGTT
GACGTGATGA CCACTGAAAC GACCTGTTTA AGTTCTGTCT GGCAAACCGA TGAAGAAGTC
CATAACTGGC TGGCGCTGCA CGGTCGCGGC CAGGATTACT GCCAGCTTAA CCCTCAACCG
ATGGCGTACT ACGATGGCTG CATCAGCGTT GATTTAAGCG CCATCAAACC AATGATTGCG
CTGCCGTTTC ACCCGAGCAA CGTGTATGAA ATCGACACGC TGAACCAGAA CCTGACCGAC
ATTCTGCGTG AGATTGAAAT TGAGTCCGAG CGCGTGGCGC ACGGTAAAGC CAAACTCTCG
CTGCTGGATA AAGTCGAAAA CGGTCGCCTG AAAGTGCAGC AGGGGATTAT CGCGGGCTGT
TCTGGCGGTA ACTACGAAAA CGTCATCGCG GCGGCGAATG CACTGCGCGG TCAATCCTGT
GGCAATGACA CCTTCTCGTT AGCGGTTTAC CCGTCATCAC AGCCCGTGTT TATGGATCTC
GCCAAAAAAG GTGTGGTAGC AGATTTGATT GGCGCAGGCG CAATCATCAG AACCGCGTTC
TGCGGCCCAT GCTTTGGCGC GGGCGATACA CCAATCAATA ACGGTTTGAG TATTCGCCAC
ACCACGCGTA ACTTCCCGAA CCGCGAAGGC TCTAAGCCAG CTAATGGGCA GATGTCAGCG
GTGGCGTTGA TGGACGCTCG TTCTATCGCT GCGACTGCGG CAAACGGTGG CTATTTAACC
TCTGCCAGCG AGCTGGATTG CTGGGACAAC GTGCCGGAGT ACGCCTTCGA TGTAACGCCG
TATAAAAACC GTGTTTATCA GGGCTTTGTG AAAGGGGCGA CCCAGCAACC GCTGATTTAC
GGACCGAACA TTAAAGACTG GCCAGAATTG GGTGCGCTGA CTGACAATAT CGTCCTCAAA
GTGTGCTCGA AGATCCTCGA CGAAGTGACC ACCACCGACG AACTGATTCC TTCCGGTGAA
ACCTCTTCAT ATCGTTCAAA TCCGATTGGT CTGGCGGAGT TTACTCTGTC ACGCCGCGAT
CCAGGTTATG TTGGCAGAAG TAAAGCGACT GCCGAGCTGG AAAATCAGCG TCTGGCGGGA
AATGTCAGCG AGCTGACAGA GGTGTTTGCG CGCATTAAGC AGATTGCTGG TCAGGAGCAT
ATTGATCCGC TGCAAACTGA AATTGGCAGT ATGGTCTATG CGGTGAAACC AGGCGATGGT
TCTGCGCGTG AACAGGCGGC GAGTTGTCAG CGTGTGATTG GCGGTCTGGC GAATATTGCC
GAGGAGTACG CGACTAAACG CTATCGTTCT AACGTCATCA ACTGGGGGAT GTTACCGCTG
CAGATGGCGG AAGCGCCAAC CTTTGAAGTG GGGGATTACA TTTACATCCC TGGCATTAAA
GCGGCACTGG ATAATCCGGG TACGACGTTT AAAGGTTATG TGATCCATGA AGATGCGCCG
GTAACGGAAA TTACGCTTTA CATGGAAAGC CTGACGGCCG AAGAGCGCGA GATTATCAAG
GCGGGTAGTT TGATTAACTT CAATAAAAAC CGTCAGATGT AA
 
Protein sequence
MIKLSEKGVF LASNNEIIAE EHFTGEIKKE EAKKGTIAWS ILSSHNTSGN MDKLKIKFDS 
LASHDITFVG IVQTAKASGM ERFPLPYVLT NCHNSLCAVG GTINGDDHIF GLSAAQRYGG
IFVPPHIAVI HQYMREMMAG GGKMILGSDS HTRYGALGTM AVGEGGGELV KQLLNDTWDI
DYPGVVAVHL TGKPAPYVGP QDVALAIIGA VFKNGYVKNK VMEFVGPGVA ALSTDFRNSV
DVMTTETTCL SSVWQTDEEV HNWLALHGRG QDYCQLNPQP MAYYDGCISV DLSAIKPMIA
LPFHPSNVYE IDTLNQNLTD ILREIEIESE RVAHGKAKLS LLDKVENGRL KVQQGIIAGC
SGGNYENVIA AANALRGQSC GNDTFSLAVY PSSQPVFMDL AKKGVVADLI GAGAIIRTAF
CGPCFGAGDT PINNGLSIRH TTRNFPNREG SKPANGQMSA VALMDARSIA ATAANGGYLT
SASELDCWDN VPEYAFDVTP YKNRVYQGFV KGATQQPLIY GPNIKDWPEL GALTDNIVLK
VCSKILDEVT TTDELIPSGE TSSYRSNPIG LAEFTLSRRD PGYVGRSKAT AELENQRLAG
NVSELTEVFA RIKQIAGQEH IDPLQTEIGS MVYAVKPGDG SAREQAASCQ RVIGGLANIA
EEYATKRYRS NVINWGMLPL QMAEAPTFEV GDYIYIPGIK AALDNPGTTF KGYVIHEDAP
VTEITLYMES LTAEEREIIK AGSLINFNKN RQM