Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1962 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 2117955 |
End bp | 2119175 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | ACX39619 |
Protein GI | 260449197 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00108569 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTTT CCGTCGACAA AGTGCGGGCC GACTTTCCGG TGCTTTCGCG TGAGGTAAAC GGTTTGCCGC TGGCTTATCT CGACAGCGCC GCCAGTGCGC AGAAACCGAG CCAGGTGATT GACGCCGAGG CCGAGTTTTA TCGTCATGGC TACGCGGCGG TGCATCGTGG TATTCATACC TTAAGCGCCC AGGCGACCGA GAAAATGGAG AACGTGCGCA AGCGGGCATC GCTGTTTATT AATGCCCGTT CGGCGGAAGA GCTGGTGTTC GTCCGCGGCA CGACGGAAGG GATCAATCTG GTCGCCAATA GCTGGGGCAA CAGCAACGTG CGGGCGGGCG ATAACATCAT CATCAGTCAG ATGGAGCACC ACGCTAACAT TGTTCCCTGG CAGATGCTTT GCGCACGCGT TGGCGCAGAG CTGCGTGTGA TCCCGCTCAA TCCCGATGGT ACGTTGCAAC TGGAGACGCT GCCTACGCTG TTTGATGAGA AAACTCGCCT GCTGGCAATT ACTCATGTCT CCAACGTGCT TGGCACAGAA AATCCACTGG CGGAAATGAT CACGCTTGCG CACCAGCATG GCGCAAAAGT GCTGGTGGAT GGCGCTCAGG CGGTGATGCA TCATCCGGTG GATGTTCAGG CGCTGGATTG CGACTTTTAC GTGTTCTCCG GGCATAAACT GTATGGCCCC ACCGGAATTG GCATTCTTTA TGTGAAAGAA GCCTTGTTGC AGGAGATGCC GCCGTGGGAA GGGGGCGGTT CTATGATCGC CACCGTCAGC CTGAGTGAAG GCACTACCTG GACCAAAGCA CCATGGCGGT TTGAAGCCGG TACACCCAAT ACCGGGGGCA TCATTGGTCT TGGCGCGGCG CTGGAGTATG TTTCGGCGCT GGGGCTTAAT AACATAGCCG AGTATGAACA GAATCTGATG CATTATGCGC TATCACAGCT GGAATCTGTA CCGGATCTCA CTCTCTATGG CCCACAAAAC AGGCTTGGCG TTATTGCTTT TAATCTCGGT AAACACCACG CCTATGATGT TGGCAGTTTT CTCGATAATT ACGGCATTGC TGTGCGTACC GGACATCACT GCGCAATGCC ATTGATGGCC TATTACAACG TCCCTGCGAT GTGTCGGGCG TCGCTGGCCA TGTATAACAC CCATGAAGAA GTGGATCGTC TGGTGACCGG CCTGCAACGT ATTCACCGTT TGCTGGGATA A
|
Protein sequence | MIFSVDKVRA DFPVLSREVN GLPLAYLDSA ASAQKPSQVI DAEAEFYRHG YAAVHRGIHT LSAQATEKME NVRKRASLFI NARSAEELVF VRGTTEGINL VANSWGNSNV RAGDNIIISQ MEHHANIVPW QMLCARVGAE LRVIPLNPDG TLQLETLPTL FDEKTRLLAI THVSNVLGTE NPLAEMITLA HQHGAKVLVD GAQAVMHHPV DVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMIATVS LSEGTTWTKA PWRFEAGTPN TGGIIGLGAA LEYVSALGLN NIAEYEQNLM HYALSQLESV PDLTLYGPQN RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA YYNVPAMCRA SLAMYNTHEE VDRLVTGLQR IHRLLG
|
| |