Gene EcSMS35_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1856 
SymbolcysB 
ID6144449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1878081 
End bp1879055 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content49% 
IMG OID641616732 
Producttranscriptional regulator CysB 
Protein accessionYP_001743910 
Protein GI170679623 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.314733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.57931e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTAC AACAACTTCG CTATATTGTT GAGGTGGTCA ATCATAACCT GAATGTCTCA 
TCAACGGCGG AAGGACTTTA CACATCACAA CCCGGGATCA GTAAACAAGT CAGAATGCTG
GAAGACGAGC TAGGCATTCA AATTTTTTCC CGAAGCGGCA AGCACCTGAC GCAGGTAACG
CCAGCAGGAC AAGAAATAAT TCGTATCGCT CGCGAAGTCC TGTCGAAAGT CGATGCCATA
AAATCGGTCG CCGGAGAGCA CACCTGGCCG GATAAAGGCT CGCTGTATAT CGCCACCACG
CATACCCAGG CACGCTACGC TTTACCAAAC GTTATCAAAG GCTTTATTGA GCGTTATCCT
CGCGTTTCTT TGCATATGCA CCAGGGCTCG CCGACACAAA TTGCTGATGC CGTCTCTAAA
GGCAATGCCG ATTTCGCGAT TGCCACAGAA GCGCTGCATC TGTATGAAGA TTTAGTGATG
TTACCGTGCT ACCACTGGAA TCGGGCTATT GTAGTCACTC CGGATCACCC GCTGGCAGGC
AAAAAAGCCA TTACCATTGA AGAACTGGCG CAATATCCGT TGGTGACATA TACCTTCGGC
TTTACCGGAC GCTCAGAACT GGATACTGCC TTTAACCGCG CAGGGTTAAC GCCGCGTATC
GTCTTCACGG CAACGGATGC TGACGTCATT AAAACTTACG TCCGGTTAGG GTTGGGGGTA
GGGGTTATTG CCAGCATGGC GGTGGATCCG GTCGCCGATC CCGACCTGGT GCGCGTTGAT
GCTCACGATA TCTTCAGCCA CAGTACAACC AAAATTGGTT TTCGCCGTAG TACTTTTTTG
CGCAGTTATA TGTATGATTT CATTCAGCGT TTTGCACCGC ATTTAACGCG TGATGTCGTT
GATGCGGCTG TCGCATTGCG CTCTAATGAA GAAATTGAGG CCATGTTTAA AGATATAAAA
CTGCCGGAAA AATAA
 
Protein sequence
MKLQQLRYIV EVVNHNLNVS STAEGLYTSQ PGISKQVRML EDELGIQIFS RSGKHLTQVT 
PAGQEIIRIA REVLSKVDAI KSVAGEHTWP DKGSLYIATT HTQARYALPN VIKGFIERYP
RVSLHMHQGS PTQIADAVSK GNADFAIATE ALHLYEDLVM LPCYHWNRAI VVTPDHPLAG
KKAITIEELA QYPLVTYTFG FTGRSELDTA FNRAGLTPRI VFTATDADVI KTYVRLGLGV
GVIASMAVDP VADPDLVRVD AHDIFSHSTT KIGFRRSTFL RSYMYDFIQR FAPHLTRDVV
DAAVALRSNE EIEAMFKDIK LPEK