Gene EcSMS35_3704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3704 
Symbol 
ID6143117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3768154 
End bp3768984 
Gene Length831 bp 
Protein Length276 aa 
Translation table11 
GC content54% 
IMG OID641618531 
Productintramembrane serine protease GlpG 
Protein accessionYP_001745671 
Protein GI170683698 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.144478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATGA TTACCTCTTT TGCTAACCCC CGAGTGGCGC AGGCGTTTGT TGATTACATG 
GCGACGCAGG GTGTTATCCT CACGATTCAA CAACATAACC AAAGCGATGT CTGGCTAGCG
GATGAGTCCC AGGCCGAACG CGTGCGGGCG GAGCTGGCGC GTTTTCTCGA AAACCCGGCA
GATCCGCGTT ATCTGGCGGC CAGCTGGCAG GCGGGCCATA CCGGCAGTGG CCTGCATTAT
CGCCGTTATC CTTTCTTTGC TGCCTTGCGT GAACGCGCAG GTCCGGTAAC CTGGGTGATG
ATGATTGCCT GCGTGGTGGT GTTTATCGCC ATGCAAATTC TCGGCGATCA GGAAGTGATG
TTATGGCTGG CCTGGCCATT CGATCCGACG CTGAAATTTG AGTTCTGGCG TTACTTCACC
CACGCGTTAA TGCACTTCTC GCTGATGCAT ATCCTCTTTA ACCTGCTCTG GTGGTGGTAT
CTCGGCGGTG CGGTGGAAAA ACGCCTCGGT AGCGGTAAGC TAATTGTCAT TACTCTCATT
AGCGCCCTGT TAAGCGGCTA TGTGCAGCAA AAATTCAGCG GGCCGTGGTT TGGCGGGCTT
TCTGGCGTGG TGTATGCGCT GATGGGCTAC GTCTGGCTAC GTGGCGAACG CGATCCGCAA
AGTGGCATTT ACCTGCAACG TGGGTTAATT ATCTTTGCGT TGATCTGGAT TGTCGCCGGA
TGGTTTGATT TGTTTGGGAT GTCGATGGCG AACGGAGCAC ACATCGCCGG GTTAGCCGTG
GGTTTAGCGA TGGCTTTTGT TGATTCGCTC AATGCGCGAA AACGAAAATA A
 
Protein sequence
MLMITSFANP RVAQAFVDYM ATQGVILTIQ QHNQSDVWLA DESQAERVRA ELARFLENPA 
DPRYLAASWQ AGHTGSGLHY RRYPFFAALR ERAGPVTWVM MIACVVVFIA MQILGDQEVM
LWLAWPFDPT LKFEFWRYFT HALMHFSLMH ILFNLLWWWY LGGAVEKRLG SGKLIVITLI
SALLSGYVQQ KFSGPWFGGL SGVVYALMGY VWLRGERDPQ SGIYLQRGLI IFALIWIVAG
WFDLFGMSMA NGAHIAGLAV GLAMAFVDSL NARKRK