Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01639 |
Symbol | sufS |
ID | 8115176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1703886 |
End bp | 1705106 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644847865 |
Product | hypothetical protein |
Protein accession | YP_002999438 |
Protein GI | 251785134 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000164024 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTTT CCGTCGACAA AGTGCGGGCC GACTTTCCGG TGCTTTCTCG TGAGGTAAAC GGTTTGCCGC TGGCTTATCT CGACAGCGCC GCCAGTGCGC AGAAACCGGG CCAGGTGATT GACGCCGAGG CCGAGTTTTA TCGTCACGGC TACGCGGCGG TGCATCGCGG TATTCATACC TTAAGCGCCC AGGCGACCGA GAAAATGGAG AACGTACGCA AGCAGGCATC GTTGTTTATC AACGCCCGTT CGGCGGAAGA GCTGGTGTTC GTCCGCGGCA CGACGGAAGG GATCAATCTG GTCGCCAATA GCTGGGGCAA CAGCAACGTG CGGGCGGGCG ATAACATCAT CATAAGTCAG ATGGAGCACC ACGCTAACAT TGTTCCCTGG CAGATGCTTT GCGCACGCGT TGGCGCAGAG CTGCGTGTGA TCCCGCTCAA TCCCGACGGT ACGTTGCAAC TGGAGACGCT GCCTACGCTG TTTGATGAGA AAACTCGCCT GCTGGCAATT ACTCATGTCT CCAACGTGCT TGGCACAGAA AATCCACTGG CGGAAATGAT CACGCTTGCG CACCAGCATG GCGCAAAAGT GCTGGTGGAT GGCGCTCAGG CGGTGATGCA TCATCTGGTG GATGTTCAGG CGCTGGATTG CGACTTTTAC GTGTTCTCCG GGCATAAACT GTATGGCCCC ACCGGAATTG GCATTCTTTA TGTCAAAGAA GCCTTGTTGC AGGAGATGCC GCCGTGGGAA GGGGGCGGTT CTATGATCGC CACCGTCAGC CTGAGTGAAG GCACTACCTG GACCAAAGCA CCATGGCGGT TTGAAGCCGG TACACCCAAT ACCGGGGGCA TCATTGGTCT TGGCGCGGCG CTGGAGTATG TTTCGGCGCT GGGGCTTAAT AACATAGCCG AGTATGAACA GAATCTGATG CATTATGCGC TATCACAGCT GGAATCTGTA CCGGATCTCA CTCTCTATGG CCCACAAAAC AGGCTTGGCG TTATTGCTTT TAATCTCGGT AAACACCACG CCTATGATGT TGGCAGTTTT CTCGATAATT ACGGCATTGC TGTGCGTACC GGACATCACT GCGCAATGCC ATTGATGGCC TATTACAACG TCCCTGCGAT GTGTCGGGCG TCGCTGGCCA TGTATAACAC CCATGAAGAA GTGGATCGTC TGGTGACCGG CCTGCAACGT ATTCACCGTT TGCTGGGATA A
|
Protein sequence | MTFSVDKVRA DFPVLSREVN GLPLAYLDSA ASAQKPGQVI DAEAEFYRHG YAAVHRGIHT LSAQATEKME NVRKQASLFI NARSAEELVF VRGTTEGINL VANSWGNSNV RAGDNIIISQ MEHHANIVPW QMLCARVGAE LRVIPLNPDG TLQLETLPTL FDEKTRLLAI THVSNVLGTE NPLAEMITLA HQHGAKVLVD GAQAVMHHLV DVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMIATVS LSEGTTWTKA PWRFEAGTPN TGGIIGLGAA LEYVSALGLN NIAEYEQNLM HYALSQLESV PDLTLYGPQN RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA YYNVPAMCRA SLAMYNTHEE VDRLVTGLQR IHRLLG
|
| |