Gene EcSMS35_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2521 
Symbol 
ID6147487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2578843 
End bp2579988 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content45% 
IMG OID641617393 
Producthypothetical protein 
Protein accessionYP_001744564 
Protein GI170680633 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATA ATGAAAGCAA AGGGCCGTTT GAAGGCTTAT TAGTTATCGA TATGACGCAT 
GTCCTTAATG GGCCTTTCGG AACTCAACTT CTTTGTAATA TGGGCGCAAG GGTAATTAAA
GTTGAGCCGC CAGGTCATGG TGATGATACC CGCACATTTG GTCCCTATGT GGATGGACAG
TCACTCTATT ACAGTTTTAT TAATCATGGC AAAGAGAGTG TGGTTCTTGA TTTAAAGAAT
GATCACGATA AAAGTATATT TATAAATATG CTTAAACAAG CTGATGTATT AGCTGAGAAT
TTTCGCCCAG GTACAATGGA AAAACTGGGG TTTTCATGGG AAAGGTTACA AGAAATCAAC
CCGCGCCTTA TATATGCTTC ATCGTCAGGT TTCGGACATA CCGGTCCGCT AAAAGATGCT
CCTGCCTACG ATACCATCAT TCAGGCAATG AGCGGGATAA TGATGGAAAC AGGATACCCT
GATGCTCCGC CAGTGCGCGT TGGTACCTCT CTTGCGGATC TATGTGGTGG TGTTTATTTA
TTCAGCGGAA TAGTGAGTGC ACTTTATGGC CGCGAAAAGA GCCAGAGAGG TGCGCATGTC
GATATAGCGA TGTTTGATGC CACGCTGAGT TTTCTGGAGC ATGGACTGAT GGCATATATC
GCGACTGGGA AGTCACCACA ACGCCTGGGA AATCGCCATC CCTACATGGC ACCTTTTGAT
GTTTTTGATA CTCAGGATAA GCCGATTACA ATTTGTTGTG GTAATGACAA GCTTTTTTCT
GCGTTATGCC AGGCACTGGA GCTTACAGAA CTGGTTAATG ATCCCCGATT TAGCAGCAAT
ATTTTACGCG TACAAAACCA GGCTATTCTT AAACAATATA TTGAGCGAAC GTTAAAAACG
CAGGCAGCTG AAGTTTGGTT AGCCAGAATA CATGAAGTTG GTGTACCCGT CGCGCCGTTA
TTAAGTGTGG CTGAGGCCAT TAATTTGCCA CAAACTCAGG CGAGAAATAT GTTGATTGAA
GCCGGAGGAA TAATGATGCC AGGTAATCCG ATAAAAATCA GCGGCTGCGC GGACCCGCAT
GTTATGCCGG GAGCGGCAAC GCTCGACCAG CATGGGGAAC AAATTCGCCA GGAGTTCTCA
TCATAA
 
Protein sequence
MTNNESKGPF EGLLVIDMTH VLNGPFGTQL LCNMGARVIK VEPPGHGDDT RTFGPYVDGQ 
SLYYSFINHG KESVVLDLKN DHDKSIFINM LKQADVLAEN FRPGTMEKLG FSWERLQEIN
PRLIYASSSG FGHTGPLKDA PAYDTIIQAM SGIMMETGYP DAPPVRVGTS LADLCGGVYL
FSGIVSALYG REKSQRGAHV DIAMFDATLS FLEHGLMAYI ATGKSPQRLG NRHPYMAPFD
VFDTQDKPIT ICCGNDKLFS ALCQALELTE LVNDPRFSSN ILRVQNQAIL KQYIERTLKT
QAAEVWLARI HEVGVPVAPL LSVAEAINLP QTQARNMLIE AGGIMMPGNP IKISGCADPH
VMPGAATLDQ HGEQIRQEFS S