Gene EcSMS35_2838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2838 
SymbolascG 
ID6146475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2913420 
End bp2914472 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content53% 
IMG OID641617707 
Producttranscriptional regulator AscG 
Protein accessionYP_001744862 
Protein GI170683113 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.106461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTACA CTGCGTTGAT TGCGTGTAAA GGGATGAAAA TGATGACGAC GATGCTGGAA 
GTGGCGAAGC GCGCCGGGGT TTCAAAAGCG ACCGTTTCCC GCGTGCTTTC AGGTAATGGC
TACGTCAGCC AGGAGACTAA AGATCGCGTG TTTCAGGCGG TAGAAGAGAG CGGTTACCGC
CCAAATTTGC TGGCCCGCAA TCTGTCGGCG AAAAGTACCC AGACGCTGGG GCTGGTAGTG
ACTAACACGC TTTACCACGG CATTTATTTT AGTGAACTAC TGTTTCATGC CGCGCGAATG
GCGGAAGAGA AAGGGCGGCA GTTGCTATTG GCAGATGGTA AACACAGCGC CGAAGAAGAG
CGCCAGGCGA TTCAGTATCT GCTGGATCTG CGCTGCGACG CGATCATGAT TTATCCGCGC
TTTTTAAGCG TGGATGAAAT TGATGACATC ATTGACGCGC ACAGTCAGCC GATAATGGTG
CTTAATCGCC GCCTGCGCAA AAATAGCAGC CATAGCGTCT GGTGCGATCA TAAACAGACC
AGCTTTAACG CCGTGGCAGA GTTGATAAAC GCCGGGCATC AGGAGATTGC TTTCCTTACC
GGCTCGATGG ATTCCCCCAC CAGCATTGAA CGTCTTGCCG GGTATAAAGA CGCGCTGTCG
CAGCATGGTA TTGCGCTCAA TGAAAAACTT ATCGCTAACG GTAAATGGAC GCCTGCCAGC
GGGGCCGAAG GGGTAGAAAC GTTGCTCGAA CGTGGGGCTA AATTTAGCGT GTTAGTTGCC
AGTAACGACG ATATGGCGAT AGGTGCGATG AAAGCGTTGC ACGAGCGCGG CGTAGCGGTG
CCAGAGCAGG TGTCAGTTAT CGGATTCGAT GATATCGCTA TTGCCCCCTA CACCGTTCCG
GCGCTCTCCA GCGTAAAAAT TCCGGTAACT GAGATGATTC AGGAAATTAT TGGACGGCTG
ATTTTTATGC TCGATGGTGG GGATTTCTCA CCGCCGAAAA CCTTCAGCGG AAAACTGATC
CGCCGCGGCT CCCTCATTGC TCTTTCGCAA TAA
 
Protein sequence
MQYTALIACK GMKMMTTMLE VAKRAGVSKA TVSRVLSGNG YVSQETKDRV FQAVEESGYR 
PNLLARNLSA KSTQTLGLVV TNTLYHGIYF SELLFHAARM AEEKGRQLLL ADGKHSAEEE
RQAIQYLLDL RCDAIMIYPR FLSVDEIDDI IDAHSQPIMV LNRRLRKNSS HSVWCDHKQT
SFNAVAELIN AGHQEIAFLT GSMDSPTSIE RLAGYKDALS QHGIALNEKL IANGKWTPAS
GAEGVETLLE RGAKFSVLVA SNDDMAIGAM KALHERGVAV PEQVSVIGFD DIAIAPYTVP
ALSSVKIPVT EMIQEIIGRL IFMLDGGDFS PPKTFSGKLI RRGSLIALSQ