Gene EcSMS35_3652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3652 
Symbol 
ID6145214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3711854 
End bp3712939 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content57% 
IMG OID641618479 
Producthypothetical protein 
Protein accessionYP_001745619 
Protein GI170682063 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.164535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0000282332 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGACGT TTCCTCTGCA AAGCCTGACG CTTATTGAGG CGCAGCAAAA GCAGTTTGCG 
CTGGTGGATA GCATTTGTCG CCATTTTCCC GGCAGCGAGT TTCTTGCTGG CGGTGATTTA
GGCTTAACGC CGGGGCTGAA TCAACCGCGC ATTACCCAGC GAGTGGAACA GGTGCTGGCT
GATGCATTTC ACGCACAGGC TGCAGCGCTG GTACAGGGCG CGGGTACTGG CGCGATCCGT
GCGGCGTTGG CGGCTTTGCT CAAACCGGGG CAGCGTCTTC TGGTGCATGA CGCGCCTGTT
TACCCGACGA CTCAGGTCAT TATTGAGCAG ATGGGGCTGA CGCTTATTAC TGCTAATTTC
AATGACCTGT CGGCACTTAA GCAGGTCGTC GACGAGCAAC AACCGGATGC GGCGCTGGTG
CAGCATACGC GCCAGCAGCC GCAGGACGGC TACATTCTGG CAGATGTGCT GGCAACGTTG
CGCTCGGCAG GTGTTCCAGC CTTAACCGAT GATAACTATG CGGTGATGAA GGTGGCGCGC
ATTGGCTGTG AATGCGGGGC GAATGTCTCG ACATTTTCCT GCTTCAAGCT ATTTGGGCCA
GAGGGTGTTG GCGCGGTGGT AGGCGATGCC GATGTTATTA GCCGGATTCG CGCCACGCTC
TATTCCGGCG GCAGCCAGGT TCAGGGCGCG CAGGCGCTGG AAGTCTTGCG TGGTCTGGTG
CTCGCGCCAG TGATGCACGC GGTGCAGGCG GGGGTATCTG AACGGTTGCT GGCTTTGCTT
AACGGCGGCG CGGTGGCGGA AGTGAAAAGC GCCGTCATTG CCAATGCACA GTCGAAGGTA
TTGATTGTGG AGTTTCATCA GCCGATTGCC GCCAGAGTGC TGGAAGAGGC GCAGAAACTC
GGTGCCTTGC CTTACCCGGT TGGTGCAGAG TCGAAATATG AAATCCCGCC GCTCTTTTAT
CGACTTTCCG GAACGTTTCG CCAGGCGAAT CCGCAGTTAG AACATTGCGC GATTCGCATT
AACCCAAATC GCAGCGGTGA AGAGACGGTA TTGCGGATTT TGCGCGAGAG TATTGCCGAC
GTTTAA
 
Protein sequence
MKTFPLQSLT LIEAQQKQFA LVDSICRHFP GSEFLAGGDL GLTPGLNQPR ITQRVEQVLA 
DAFHAQAAAL VQGAGTGAIR AALAALLKPG QRLLVHDAPV YPTTQVIIEQ MGLTLITANF
NDLSALKQVV DEQQPDAALV QHTRQQPQDG YILADVLATL RSAGVPALTD DNYAVMKVAR
IGCECGANVS TFSCFKLFGP EGVGAVVGDA DVISRIRATL YSGGSQVQGA QALEVLRGLV
LAPVMHAVQA GVSERLLALL NGGAVAEVKS AVIANAQSKV LIVEFHQPIA ARVLEEAQKL
GALPYPVGAE SKYEIPPLFY RLSGTFRQAN PQLEHCAIRI NPNRSGEETV LRILRESIAD
V