Gene EcSMS35_4658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4658 
Symbol 
ID6144078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4759057 
End bp4760682 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content55% 
IMG OID641619474 
Productisovaleryl CoA dehydrogenase 
Protein accessionYP_001746582 
Protein GI170680540 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.61602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACTGGC AAACTCACAC CGTTTTTAAT CAACCTATAC CATTAAATAA CAGCAATTTA 
TACCTGTCTG ATGGCGCGCT CTGCGAAGCG GTAACGCGTG AAGGTGCTGG CTGGGATAGC
GATTTTCTAG CCAGTATTGG TCAGCAGTTA GGGACGGCTG AATCCCTTGA ACTGGGGCGG
CTGGCGAATG TGAATCCGCC TGAATTATTG CGCTACGATG CGCAAGGACG CCGTCTGGAC
GATGTGCGTT TTCACCCCGC CTGGCACCTG CTGATGCAGG CGCTATGTAC CAATCGGGTG
CACAATCTTG CCTGGGAAGA AGACGCTCGC TCCGGCGCAT TTGTGGCGCG CGCGGCGCGT
TTTATGTTAC ATGCGCAGGT TGAGGCTGGG TCGTTATGTC CGATAACGAT GACCTTTGCC
GCCACGCCAT TGCTGTTACA GATGTTACCC GCGCCGTTTC AGGACTGGAC CACGCCGCTG
TTGAGCGATC GCTACGATTC TCACTTATTG CCAGGTGGGC AAAAACGCGG TTTGTTGATT
GGCATGGGAA TGACGGAAAA GCAGGGCGGT TCCGATGTCA TGAGCAACAC CACCCGCGCA
GAGCGCCTGG AAGATGGCTC TTATCGGCTG GTGGGGCATA AATGGTTTTT CTCGGTGCCG
CAAAGCGATG CGCATCTGGT GCTGGCGCAG ACTACGGGCG GTTTGTCCTG CTTTTTTGTG
CCGCGCTTTT TGCCTGACGG GCAACGCAAC GCGATTCGCC TCGAGCGGCT GAAAGATAAG
CTGGGTAATC GCTCTAACGC CAGCTGCGAA GTGGAGTTTC AGGATGCCAT TGGCTGGTTG
TTGGGGCAGG AAGGGGAAGG AATTCGTCTG ATCCTGAAAA TGGGTGGGAT GACGCGTTTT
GATTGCGCCC TGGGTAGCCA TGCCATGATG CGCCGTGCAT TTTCGCTGGC GATTTATCAT
GCACATCAAC GCCATGTTTT TGGTAATCCA TTGATCCAAC AGCCCCTTAT GCGTCATGTC
TTAAGTCGCA TGGCGCTTCA GCTTGAAGGG CAAACGGCGT TGCTGTTTCG TCTTGCGCGA
GCGTGGGACC GGCGTGCCGA TGCCAAAGAA GCTCTGTGGG CGCGTTTATT TACGCCTGCG
GCGAAATTTG TGATCTGCAA ACATGGTATT CCGTTTGTGG CCGAAGCGAT GGAGGTGCTG
GGCGGCATTG GTTATTGCGA GGAGAGCGAG CTGCCGCGGC TTTACCGGGA GATGCCGGTA
AACAGTATTT GGGAAGGTTC CGGCAATATT ATGTGCCTGG ATGTGCTGCG CGTTCTCAAT
AAGCAAGCGG GCGTATACGA CTTATTGTCG GAAGCATTTG TGGAAGTGAA AGGGCAGGAT
CGCTATTTTG ATCGCGCGGT TCGTCGTTTA CAGCAGCAGC TGCGTAAGCC AGCTGAAGAA
TTGGGGCGAG AGATTACTCA TCAGCTATTC CTGCTGGGCT GCGGTGCGCA AATGTTGAAA
TATGCCTCTC CGCCAATGGC GCAGGCGTGG TGTCAGGTGA TGTTAGATAC GCGCGGCGGC
GTACGGTTGT CAGAGCAGAT CCAGAATGAT TTATTGCTGC GGGCGACGGG AGGAGTGTGT
TTGTAA
 
Protein sequence
MHWQTHTVFN QPIPLNNSNL YLSDGALCEA VTREGAGWDS DFLASIGQQL GTAESLELGR 
LANVNPPELL RYDAQGRRLD DVRFHPAWHL LMQALCTNRV HNLAWEEDAR SGAFVARAAR
FMLHAQVEAG SLCPITMTFA ATPLLLQMLP APFQDWTTPL LSDRYDSHLL PGGQKRGLLI
GMGMTEKQGG SDVMSNTTRA ERLEDGSYRL VGHKWFFSVP QSDAHLVLAQ TTGGLSCFFV
PRFLPDGQRN AIRLERLKDK LGNRSNASCE VEFQDAIGWL LGQEGEGIRL ILKMGGMTRF
DCALGSHAMM RRAFSLAIYH AHQRHVFGNP LIQQPLMRHV LSRMALQLEG QTALLFRLAR
AWDRRADAKE ALWARLFTPA AKFVICKHGI PFVAEAMEVL GGIGYCEESE LPRLYREMPV
NSIWEGSGNI MCLDVLRVLN KQAGVYDLLS EAFVEVKGQD RYFDRAVRRL QQQLRKPAEE
LGREITHQLF LLGCGAQMLK YASPPMAQAW CQVMLDTRGG VRLSEQIQND LLLRATGGVC
L