Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1764 |
Symbol | |
ID | 5113285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 1917806 |
End bp | 1919026 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640491953 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_001176494 |
Protein GI | 146311420 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000868369 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000289207 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTTTCC CGGTAGAGAA AGTGCGGGCG GATTTCCCTG TCCTGACCCG TGAAGTCAAC GGTTTGCCGC TTGCCTATCT CGACAGCGCC GCCAGCGCGC AAAAGCCGAA TCAGGTGATT GACGCCGAGA TGGAATTTTA CCGCCACGGC TATGCGGCCG TGCATCGCGG CATTCATACC CTGAGCGCAG AAGCCACTCA GCGCATGGAA AATGTCCGCA CGCAGGTAGC GGCATTCCTG AACGCCCGTT CAGCGGAAGA GCTGGTGTTT GTGCGCGGCA CAACAGAGGG GATCAACCTG GTCGCCAATA GCTGGGGCAA TGCGCAGGTG CATGCGGGCG ATAATATCGT GATCACCCAG ATGGAGCACC ACGCCAATAT CGTGCCGTGG CAGATGCTCT GTGAGCGCTC AGGCGCACAG CTGCGCGTCA TTCCACTTAA CGTGGACGGC ACGTTGCAGC TGGAACAGCT CGACGCGTTG CTTGACGCGC GTACGCGACT GGTGGCGATT ACGCAGATCT CTAACGTTCT TGGCACCGCG AATCCGGTCG CAGAAATCAT TGCGAAAGCG CATCAGGCTG GCGCAAAAGT GCTGGTGGAT GGCGCACAGG CCGTTATGCA TCACACTATT GACGTGCAGG CGCTGGACTG TGATTTTTAC GTGTTTTCCG GTCACAAGCT GTATGGCCCA ACCGGAATCG GTGTGCTGTA TGTGAAAGAA GATATTTTGC AGGCGATGCC GCCGTGGGAA GGGGGCGGAT CGATGATTGC GACCGTCAGC CTGACGCAAG GCACGACCTA CGCCAAAGCC CCGTGGCGCT TTGAAGCGGG TACACCGAAT ACGGGCGGGA TCATCGGGCT GGGTGCGGCA ATCGACTACG TTTCCACACT CGGTTTGGAT GCTATCGCCG AGTATGAAGC GTCGCTGATG CGCTATGCGC TGGCGGAAAT GGCCAGCGTC CCGGATCTCA CGCTGTACGG CCCTGACGCG CGTAAAGGCG TTATTGCCTT TAATCTGGGC AAACATCACG CTTACGACGT GGGCAGTTTC CTTGATAATT ATGGCGTGGC GGTACGAACG GGTCACCACT GCGCAATGCC GCTGATGGCG TTTTACCAGG TCCCGGCAAT GTGCCGCGCG TCGCTGGTGA TGTATAACAC GACGGAAGAG GTCGACAGGC TGGTGACGGG GCTCAAACGC ATCCATCATC TCCTGGGATA A
|
Protein sequence | MSFPVEKVRA DFPVLTREVN GLPLAYLDSA ASAQKPNQVI DAEMEFYRHG YAAVHRGIHT LSAEATQRME NVRTQVAAFL NARSAEELVF VRGTTEGINL VANSWGNAQV HAGDNIVITQ MEHHANIVPW QMLCERSGAQ LRVIPLNVDG TLQLEQLDAL LDARTRLVAI TQISNVLGTA NPVAEIIAKA HQAGAKVLVD GAQAVMHHTI DVQALDCDFY VFSGHKLYGP TGIGVLYVKE DILQAMPPWE GGGSMIATVS LTQGTTYAKA PWRFEAGTPN TGGIIGLGAA IDYVSTLGLD AIAEYEASLM RYALAEMASV PDLTLYGPDA RKGVIAFNLG KHHAYDVGSF LDNYGVAVRT GHHCAMPLMA FYQVPAMCRA SLVMYNTTEE VDRLVTGLKR IHHLLG
|
| |