Gene EcSMS35_2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2011 
Symbol 
ID6143108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2031344 
End bp2032417 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content49% 
IMG OID641616887 
Productputative acyltransferase 
Protein accessionYP_001744063 
Protein GI170683490 
COG category[S] Function unknown 
COG ID[COG4763] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.020428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA AAGAGCTATG GATTAACCAG ATCAAAGGGT TATGTATTTG TCTGGTAGTG 
ATTTATCACT CGGTGATTAC CTTTTATCCG CATCTGAGCA CTTTTCAGCA TCCGTTATCG
GAAGTCCTGA GCAAATGCTG GATCTATTTC AATCTTTACC TTGCCCCCTT TCGTATGCCG
GTTTTTTTCT TTATCTCTGG CTATTTAATT CGCCGCTATA TCGACAGCGT GCCATGGGGA
AATTGTCTTG ATAAACGCAT CTGGAGCATC TTCTGGGTGC TGGCACTTTG GGGCGTAGTG
CAGTGGCTGG CGATAAGTGC ACTAAATCAG TGGCTGGCAC CTGAGCGCAA TTTAAGTAAT
GCCTCCAATG CCGCTTATGC CGATTCTACC GGTGAGTTCC TGCACGGGAT GATCACCGCC
AGCACCAGCT TGTGGTATCT GTATGCTTTA ATTGTCTATT TCGTGATATG TAAAATTTTT
AACCGCCTGG CGCTGCCACT ATTCGCCTTG TTTGTACTGC TGAGTGTGGC GGTTAATTTC
GTTCCCACGC CGTGGTGGGG AATGAACAGT GTGATCCGCA ATTTGCCTTA TTACAGCCTT
GGCGCATGGT TTGGCGCAAC AATAATGACC TGTGTTAAAG AGGTGCCGTT GCGCCGCCAT
CTGCTGATGG CTTCTTTGCT GACCGTTCTG GCGGTCGGTG CCTGGTTGTT TACTATCTCG
CTGCTGTTGT CGCTGGTATC GATTGTGGTA ATCATGAAGC TGTTTTATCA GTACGAGCAG
CGTTTCGGTA TGCGTTCCTC CAGCCTGCTG AATGTGATTG GTTCCAACAC CATTGCTATC
TACACCACCC ATCGCATTCT GGTTGAAATA TTCAGCTTAA CTCTGCTTGC GCAAATGAAC
GCAGCACGCT GGTCGCCGCA AGTCGAACTG ACACTCCTGC TGGTTTACCC CTTTGTTAGT
TTGTTCATCT GTACTGTTGC GGGCTTGCTG GTAAGAAAAC TTTCACAGCG CGCATTCAGC
GATCTGTTGT TCTCCCCGCC TTCTCTGCCC GCGGCCGTCA GTTACTCCCG CTAA
 
Protein sequence
MKQKELWINQ IKGLCICLVV IYHSVITFYP HLSTFQHPLS EVLSKCWIYF NLYLAPFRMP 
VFFFISGYLI RRYIDSVPWG NCLDKRIWSI FWVLALWGVV QWLAISALNQ WLAPERNLSN
ASNAAYADST GEFLHGMITA STSLWYLYAL IVYFVICKIF NRLALPLFAL FVLLSVAVNF
VPTPWWGMNS VIRNLPYYSL GAWFGATIMT CVKEVPLRRH LLMASLLTVL AVGAWLFTIS
LLLSLVSIVV IMKLFYQYEQ RFGMRSSSLL NVIGSNTIAI YTTHRILVEI FSLTLLAQMN
AARWSPQVEL TLLLVYPFVS LFICTVAGLL VRKLSQRAFS DLLFSPPSLP AAVSYSR