Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1856 |
Symbol | cysB |
ID | 6144449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1878081 |
End bp | 1879055 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616732 |
Product | transcriptional regulator CysB |
Protein accession | YP_001743910 |
Protein GI | 170679623 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.314733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 2.57931e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATTAC AACAACTTCG CTATATTGTT GAGGTGGTCA ATCATAACCT GAATGTCTCA TCAACGGCGG AAGGACTTTA CACATCACAA CCCGGGATCA GTAAACAAGT CAGAATGCTG GAAGACGAGC TAGGCATTCA AATTTTTTCC CGAAGCGGCA AGCACCTGAC GCAGGTAACG CCAGCAGGAC AAGAAATAAT TCGTATCGCT CGCGAAGTCC TGTCGAAAGT CGATGCCATA AAATCGGTCG CCGGAGAGCA CACCTGGCCG GATAAAGGCT CGCTGTATAT CGCCACCACG CATACCCAGG CACGCTACGC TTTACCAAAC GTTATCAAAG GCTTTATTGA GCGTTATCCT CGCGTTTCTT TGCATATGCA CCAGGGCTCG CCGACACAAA TTGCTGATGC CGTCTCTAAA GGCAATGCCG ATTTCGCGAT TGCCACAGAA GCGCTGCATC TGTATGAAGA TTTAGTGATG TTACCGTGCT ACCACTGGAA TCGGGCTATT GTAGTCACTC CGGATCACCC GCTGGCAGGC AAAAAAGCCA TTACCATTGA AGAACTGGCG CAATATCCGT TGGTGACATA TACCTTCGGC TTTACCGGAC GCTCAGAACT GGATACTGCC TTTAACCGCG CAGGGTTAAC GCCGCGTATC GTCTTCACGG CAACGGATGC TGACGTCATT AAAACTTACG TCCGGTTAGG GTTGGGGGTA GGGGTTATTG CCAGCATGGC GGTGGATCCG GTCGCCGATC CCGACCTGGT GCGCGTTGAT GCTCACGATA TCTTCAGCCA CAGTACAACC AAAATTGGTT TTCGCCGTAG TACTTTTTTG CGCAGTTATA TGTATGATTT CATTCAGCGT TTTGCACCGC ATTTAACGCG TGATGTCGTT GATGCGGCTG TCGCATTGCG CTCTAATGAA GAAATTGAGG CCATGTTTAA AGATATAAAA CTGCCGGAAA AATAA
|
Protein sequence | MKLQQLRYIV EVVNHNLNVS STAEGLYTSQ PGISKQVRML EDELGIQIFS RSGKHLTQVT PAGQEIIRIA REVLSKVDAI KSVAGEHTWP DKGSLYIATT HTQARYALPN VIKGFIERYP RVSLHMHQGS PTQIADAVSK GNADFAIATE ALHLYEDLVM LPCYHWNRAI VVTPDHPLAG KKAITIEELA QYPLVTYTFG FTGRSELDTA FNRAGLTPRI VFTATDADVI KTYVRLGLGV GVIASMAVDP VADPDLVRVD AHDIFSHSTT KIGFRRSTFL RSYMYDFIQR FAPHLTRDVV DAAVALRSNE EIEAMFKDIK LPEK
|
| |