Gene EcSMS35_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2372 
Symbol 
ID6146244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2408353 
End bp2409675 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content53% 
IMG OID641617245 
Productputative short chain fatty acid transporter 
Protein accessionYP_001744417 
Protein GI170682719 
COG category[I] Lipid transport and metabolism 
COG ID[COG2031] Short chain fatty acids transporter 
TIGRFAM ID[TIGR00366] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.608941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGTC GCATATCGCG TTTTATGACG CGTTTTGTCA GCCGGTGGCT TCCCGATCCA 
CTGATCTTTG CCATGTTGCT GACATTGCTA ACATTCGTGA TCGCGCTTTG GTTAACACCA
CAAACGCCGA TCAGCATGGT GAAAATGTGG GGTGACGGTT TCTGGAACTT GCTGGCGTTT
GGTATGCAGA TGGCGCTTAT CATCGTTACC GGTCATGCCC TTGCCAGCTC TGCTCCGGTA
AAAAGTTTGC TGCGTACTGC CGCCTCCGCC GCAAAGACGC CCGTACAGGG CGTCATGCTG
GTTACTTTCT TCGGTTCAGT CGCTTGTGTC ATCAACTGGG GATTTGGTTT GGTTGTCGGC
GCAATGTTTG CCCGTGAAGT CGCCCGCCGA GTACCCGGTT CTGATTATCC GTTGCTCATT
GCCTGCGCCT ACATTGGTTT TCTCACCTGG GGTGGCGGTT TCTCTGGCTC AATGCCTCTG
TTGGCTGCAA CACCGGGCAA CCCGGTTGAG CATATCGCCG GGCTGATCCC GGTGGGCGAT
ACTCTGTTCA GTGGTTTTAA CATTTTCATC ACTGTGGCGT TGATTGTGGT GATGCCATTT
ATCACCCGCA TGATGATGCC AAAACCGTCT GACGTGGTGA GTATCGATCC GAAACTACTC
ATGGAAGAGG CTGATTTCCA AAAGCAGCTA CCGAAAGATG CCCCACCATC CGAGCGACTG
GAAGAAAGCC GCATTCTGAC GTTGATCATC GGCGCACTCG GTATCGCTTA CCTTGCGATG
TACTTCAGCG AACATGGCTT CAACATCACC ATCAATACCG TCAACCTGAT GTTTATGATT
GCGGGTCTGC TGCTACATAA AACGCCAATG GCTTATATGC GTGCTATCAG CGCGGCAGCA
CGGAGTACTG CCGGTATTCT GGTGCAATTC CCCTTCTACG CTGGGATCCA ACTGATGATG
GAGCATTCCG GTCTGGGCGG ACTCATTACC GAATTCTTCA TCAATGTTGC GAACAAAGAC
ACCTTCCCGG TAATGACCTT TTTTAGTTCT GCACTGATTA ACTTCGCCGT TCCGTCTGGC
GGCGGTCACT GGGTTATTCA GGGACCTTTC GTGATACCCG CAGCCCAGGC GCTGGGCGCT
GATCTCGGTA AATCGGTAAT GGCGATCGCC TACGGCGAGC AATGGATGAA CATGGCACAA
CCGTTCTGGG CGCTGCCAGC ACTGGCAATC GCCGGACTCG GTGTCCGCGA CATCATGGGC
TATTGCATCA CTGCCCTGCT CTTCTCCGGC GTCATTTTCG TCATTGGTTT AACGCTGTTC
TGA
 
Protein sequence
MIGRISRFMT RFVSRWLPDP LIFAMLLTLL TFVIALWLTP QTPISMVKMW GDGFWNLLAF 
GMQMALIIVT GHALASSAPV KSLLRTAASA AKTPVQGVML VTFFGSVACV INWGFGLVVG
AMFAREVARR VPGSDYPLLI ACAYIGFLTW GGGFSGSMPL LAATPGNPVE HIAGLIPVGD
TLFSGFNIFI TVALIVVMPF ITRMMMPKPS DVVSIDPKLL MEEADFQKQL PKDAPPSERL
EESRILTLII GALGIAYLAM YFSEHGFNIT INTVNLMFMI AGLLLHKTPM AYMRAISAAA
RSTAGILVQF PFYAGIQLMM EHSGLGGLIT EFFINVANKD TFPVMTFFSS ALINFAVPSG
GGHWVIQGPF VIPAAQALGA DLGKSVMAIA YGEQWMNMAQ PFWALPALAI AGLGVRDIMG
YCITALLFSG VIFVIGLTLF