Gene EcSMS35_3513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3513 
Symbol 
ID6144783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3591493 
End bp3593742 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content59% 
IMG OID641618342 
Producthypothetical protein 
Protein accessionYP_001745489 
Protein GI170681434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC AAACCGTTCA TTTGTTTGGC ATCCGTCACC ATGGCCCTGG CTGCGCGCGC 
AGCCTGCGAA AGGCGCTGGA GACGCTACAG CCCGACTGCC TGCTGGTGGA AGGGCCGCCC
GACGGCGAAT CCATCCTGCC TTTTATGCAG CATGAACAGA TGCAACCGCC GGTCGCGCTA
CTGATTTATG CCCCGGACGA CTCACATCAC GCCGCGTTTT ACCCCTTTGC GGCGTTCTCC
CCGGAATGGC AGGCGCTGCG CTATGGTTTT GAGCAAAATA TCGCCGTGCG CTTTATGGAC
TTGCCCATCA GTCATCAGTT CGCGCTTGAT GACGAAAAAG AGAGCGATGA CGATGATGCC
GTTGAAGCCA GTCCCGACGG CGATCCGCTG GACTGGCTGG GCCGCGCCGC CGGTTACACC
GATGGGGAAA GCTGGTGGAG CCACAGAGTA GAGGAACGCG AAGATGATTT GTCTCTGTTC
GAGGCGATTC GCGAGGCGAT GATTGCCCTA CGCCAGGCAA CTCCCGAAGC CCGAAACTCA
GCCAGAGATC AACTGCGTGA AGCGTATATG CGCAAGACGC TACGTCAGGC GAAAAAAGAG
GGCTTCTCGC GCATCGCGGT AGTCTGCGGT GCGTGGCACG TTCCGGCGCT GGAAAACCTG
CCGCCCGCAA AGAATGATAA CGAACTGCTG AAAAATTTGC CGAAGTGCAA GGTCGCGGCA
GCCTGGACGC CCTGGAGCTA CGAGGCGTTA AGCCGCGCCA GCGGTTATGG CGCAGGGGTG
GTGTCACCGG AATGGTACGA TCATCTGTGG CGTTATCAGG GCGCTTCTCA CCGCGATATC
GGCTGGCTGT CACGCGCGGC AAGGCTGTTT CGCGAAGCCG ATCTCGACTG TTCCAGCGCC
CACATTATCG AAGCGGCGCG GCTGGCGCAG ACGCTGGCGA TTATGCGCCA TCACCCGCAG
CCGGGGCTTG ACGAACTTTG CGAAGCGCTG CAAACCGTAG TGTGTATGGG CGAAAGCGCG
CCAATGCAGA TGATTCGCCA GAAGTTAATT GTCGGCGACG CGCTGGGAAG CGTACCGGAT
GATACGCCCG TCGTGCCGCT CCAACGCGAC ATTACGCAAC AGCAAAAAAC GCTGCGCCTG
AAGGCAGAAG CCAGCGAAAA AGTGCTGGAT CTTGATCTGC GCAAACCCGG CGATCTGGCG
CGTAGCCATT TACTGCATCG CCTGACCTTA CTCGACATTT CCTGGGGCCG TCTGGCTGGA
CAAGGAAATA ACAGCAAAGG GACGTTCCAC GAAGTCTGGT CGCTGCGCTG GGAACCGGCA
CTGGCAATCA ACATCATTAC CGCCAGCCGT TGGGGTAACA GCATTGAGCA GGCCAGCTCC
CGCTATGCCA TTGTGAGGGC GCAATACGCC AGCACGCTGC CGGAACTGGC AAAGCTTATC
CAACAGGTGC TACTGGCCGA TTTACAGACC GCGATTGCCC CCATCGCCAA TACACTGGAA
TCACTGGCTG CCACTCAGGG CAATATCGAA CAGTTGCTGG AAGCCCTGGC ACCACTGGTG
GCGATTGTCC GCTACGGCAA CGTGCGCCAG ACCGACTCTG GTATGGTAAT GCAGGTACTG
ATGAGCCTCG CCCCGCGCGC CGCCATCGCC CTACCCGGAG CCTGTTCGGC GCTCAACGAC
GACAGCGCCG CCAGTATGAG AGAAAAAGTT ATCGACGCCC ACGCCGCGCT GCGGTTGCTG
GATAACGAAG ATCTATTGGC GGGCTGGTTG CAGGCGTTGA TGGTGCTGGC AGAGGGCAGT
ACCGCACACG CGCTGCTTCG CGGAACGGCG ACGCGCCTGT TGTTTGATCT GCAAACGTTA
ACGACAGAAC GGATCAGCAC GCTGATGAAC CTGGCGCTAT CGCCAGCCAA TCCCCCGGCG
GAAAGTGCCG CATGGGCGGA AGGTTTCCTC AACGACAGCG CGATGGTGCT ACTGCACAAT
AGCGAATTAT GGCAGTTGAT CGATGCCTGG CTAAGCGGCC TGAACGACAA TCACTTCACC
CGGATCCTGC CGATGCTGCG GCGGACATTC GGCCGTTTTT CCAGCCCGGA ACGCCGCCAG
TTAGGCGAAC GCGCCGCGCA GGGCGAGCGT GTCGCGCAGC AAGAAGAGAC CAGCGGTATA
TGGGATGAAC AGCGTGCCGC GTTAATGCTG CCGCTACTGC GCCGCATTCT GGCTTTGCCA
CCACACGAAA GGGAAAATCA TGTCGAGTAA
 
Protein sequence
MTEQTVHLFG IRHHGPGCAR SLRKALETLQ PDCLLVEGPP DGESILPFMQ HEQMQPPVAL 
LIYAPDDSHH AAFYPFAAFS PEWQALRYGF EQNIAVRFMD LPISHQFALD DEKESDDDDA
VEASPDGDPL DWLGRAAGYT DGESWWSHRV EEREDDLSLF EAIREAMIAL RQATPEARNS
ARDQLREAYM RKTLRQAKKE GFSRIAVVCG AWHVPALENL PPAKNDNELL KNLPKCKVAA
AWTPWSYEAL SRASGYGAGV VSPEWYDHLW RYQGASHRDI GWLSRAARLF READLDCSSA
HIIEAARLAQ TLAIMRHHPQ PGLDELCEAL QTVVCMGESA PMQMIRQKLI VGDALGSVPD
DTPVVPLQRD ITQQQKTLRL KAEASEKVLD LDLRKPGDLA RSHLLHRLTL LDISWGRLAG
QGNNSKGTFH EVWSLRWEPA LAINIITASR WGNSIEQASS RYAIVRAQYA STLPELAKLI
QQVLLADLQT AIAPIANTLE SLAATQGNIE QLLEALAPLV AIVRYGNVRQ TDSGMVMQVL
MSLAPRAAIA LPGACSALND DSAASMREKV IDAHAALRLL DNEDLLAGWL QALMVLAEGS
TAHALLRGTA TRLLFDLQTL TTERISTLMN LALSPANPPA ESAAWAEGFL NDSAMVLLHN
SELWQLIDAW LSGLNDNHFT RILPMLRRTF GRFSSPERRQ LGERAAQGER VAQQEETSGI
WDEQRAALML PLLRRILALP PHERENHVE