Gene EcSMS35_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1501 
Symbol 
ID6146670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1484513 
End bp1485667 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content51% 
IMG OID641616379 
Productputative acyl-CoA dehydrogenase 
Protein accessionYP_001743559 
Protein GI170684026 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0210737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGATT TTTCTTTAAC TGAAGAACAA GAACTGCTGC TGGCCAGTAT TCGCGAACTG 
ATTACGACTA ACTTTCCGGA AGAGTATTTC CGCACCTGCG ATCAAAACGG GACATACCCG
CGTGAGTTTA TGCGGGCACT GGCGGATAAC GGTATTTCCA TGCTTGGCGT GCCGGAAGAA
TTTGGTGGTA TCCCTGCGGA TTACGTCACC CAAATGCTGG CGCTGATGGA AGTGTCAAAA
TGCGGTGCTC CGGCATTTTT GATTACCAAC GGTCAATGTA TTCACAGTAT GCGCCGTTTC
GGTTCTGCAG AGCAGCTACG TAAAACGGCA GAGAGCACAC TGGAAACGGG TGATCCCGCC
TATGCCCTGG CATTGACGGA GCCAGGCGCA GGCTCAGACA ACAACAGTGC CACTACCACT
TACACGCGTA AAAACGGCAA GGTTTACATC AACGGACAGA AAACCTTTAT TACCGGTGCG
AAAGAGTATC CATATATGCT GGTGTTGGCG CGCGATCCGC AACCGAAAGA TCCGAAAAAG
GCCTTTACTC TGTGGTGGGT CGACTCCAGT AAACCCGGCA TTAAAATCAA TCCACTGCAT
AAAATCGGCT GGCATATGCT CAGCACCTGC GAAGTCTATC TCGACAACGT GGAAGTTGAA
GAGAGCGACA TGGTGGGCGA AGAAGGAATG GGTTTCCTCA ATGTGATGTA CAACTTTGAG
ATGGAGCGCC TGATCAACGC CGCGCGCAGC ACCGGCTTTG CCGAATGCGC ATTCGAAGAT
GCCGCCCGCT ATGCCAACCA GCGTATCGCT TTTGGTAAGC CCATTGGTCA TAACCAGATG
ATCCAGGAAA AACTGGCGCT GATGGCGATT AAGATCGACA ACATGCGCAA CATGGTGTTG
AAAGTGGCAT GGCAAGCCGA TCAGCATCAG TCACTGCGCA CCAGCGCGGC GCTGGCAAAA
CTGTACTGCG CACGTACCGC AATGGAAGTC ATTGATGATG CGATTCAAAT CATGGGCGGT
CTGGGCTATA CCGATGAGGC GCGCGTGTCC CGCTTCTGGC GTGATGTCCG TTGTGAACGT
ATCGGCGGCG GTACAGACGA AATTATGATT TACGTAGCGG GTCGGCAGAT CCTGAAAGAT
TATCAGAACA AATAA
 
Protein sequence
MMDFSLTEEQ ELLLASIREL ITTNFPEEYF RTCDQNGTYP REFMRALADN GISMLGVPEE 
FGGIPADYVT QMLALMEVSK CGAPAFLITN GQCIHSMRRF GSAEQLRKTA ESTLETGDPA
YALALTEPGA GSDNNSATTT YTRKNGKVYI NGQKTFITGA KEYPYMLVLA RDPQPKDPKK
AFTLWWVDSS KPGIKINPLH KIGWHMLSTC EVYLDNVEVE ESDMVGEEGM GFLNVMYNFE
MERLINAARS TGFAECAFED AARYANQRIA FGKPIGHNQM IQEKLALMAI KIDNMRNMVL
KVAWQADQHQ SLRTSAALAK LYCARTAMEV IDDAIQIMGG LGYTDEARVS RFWRDVRCER
IGGGTDEIMI YVAGRQILKD YQNK