Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_4011 |
Symbol | |
ID | 5110476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 4350409 |
End bp | 4351956 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640494229 |
Product | threonine dehydratase |
Protein accession | YP_001178717 |
Protein GI | 146313643 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01124] threonine ammonia-lyase, biosynthetic, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0184686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCCG AATCACAACC CTTATCCGCC GCCCCCGATG GGGCGGAATA TCTCCGGGCG GTGCTACGTG CGCCAGTCTA CGAGGCAGTG CAAGTCACAC CGCTGCAAAA AATGGACAAG CTCTCGTCAC GTCTCGATAA TGTGATTCTG GTGAAGCGCG AAGACCGTCA GCCTGTGCAC AGCTTCAAGC TGCGCGGCGC TTATACGATG ATGGCCGGGC TGAACGAGGA GCAAAAAGCA CGCGGTGTAA TTACGGCTTC TGCGGGCAAC CATGCGCAGG GCGTGGCATT CTCTGCCGCG CGTCTGGGCG TGAAAGCGCT CATCGTGATG CCCGTAGCAA CCGCTGATAT CAAAGTCGAT GCGGTGCGCG GCTTTGGTGG CGAAGTGTTA TTGCACGGCG CCAACTTTGA CGAAGCCAAA GCGAAAGCGA TCGACCTCGC GCAGCAACAG GGTTTTACCT GGGTACCGCC GTTTGATCAT CCGATGGTCA TTGCCGGCCA GGGAACGCTA GCCCTGGAGC TGTTGCAGCA GGATGCGCAC CTCGATCGGA TATTCGTCCC GGTCGGCGGC GGCGGCCTTG CCGCAGGCGT CGCGGTGCTC ATCAAGCAGT TAATGCCGCA AATCAAAGTC ATCGCCGTTG AGGCGGAAGA CTCTGCGTGC CTGAAAGCGG CATTGGATGC GGGTCATCCT GTCGATCTGG CGCGGGTGGG GTTGTTTGCC GAAGGCGTTG CCGTGAAGCG TATTGGCGAT GAAACCTTCC GCGTGTGTCA GGAATATCTC GACGACATCA TCACCGTCGA CAGCGATTCT ATTTGCGCCG CAATGAAAGA TCTGTTCGAA GATGTTCGTG CGGTCGCGGA ACCGTCCGGT GCGCTGGCGC TGGCGGGGAT GAAGAAATAC ATCGCGCAGC ACAACATTCG CGGCGAACGT CTGGCGCATG TGTTGTCTGG CGCTAACGTC AATTTCCACG GCTTGCGTTA CGTTTCCGAA CGCTGCGAGC TGGGCGAACA GCGTGAAGCG CTCCTGGCGG TGACCATTCC GGAAGAGAAG GGCAGCTTCC TGAAATTCTG CCAACTGCTC GGCGGGCGTT CGGTCACCGA ATTTAACTAC CGTTTTGCCG ATGCCAAAGA CGCCTGTATT TTTGTTGGTG TGCGTTTGAG CCGTGGCGTA GAAGAGCGCA AAGAGATTCT GAGTTTGCTG CATGACGGCG GCTACAGCGT GGTGGATCTC TCCGATGATG AAATGGCGAA GCTACACGTA CGCTACATGG TCGGCGGACG CCCGTCTAAG CCGTTGCAGG AACGTTTGTT CAGCTTTGAG TTCCCGGAAT CACCGGGTGC GTTACTGAAA TTCCTTCATA CGCTGGGAAC GCACTGGAAC ATTTCCCTGT TCCATTATCG CAGCCACGGT ACGGATTTCG GACGCGTGCT GGCCGCCTTT GAGCTGGGCG AGCACGAGCC AGATTTCGAA ACGCGGCTGA ACGAGTTGGG CTACGATTGT CATGACGAAA CCCATAATCC GGCGTTCAGG TTCTTCCTGG CGGGCTAG
|
Protein sequence | MMAESQPLSA APDGAEYLRA VLRAPVYEAV QVTPLQKMDK LSSRLDNVIL VKREDRQPVH SFKLRGAYTM MAGLNEEQKA RGVITASAGN HAQGVAFSAA RLGVKALIVM PVATADIKVD AVRGFGGEVL LHGANFDEAK AKAIDLAQQQ GFTWVPPFDH PMVIAGQGTL ALELLQQDAH LDRIFVPVGG GGLAAGVAVL IKQLMPQIKV IAVEAEDSAC LKAALDAGHP VDLARVGLFA EGVAVKRIGD ETFRVCQEYL DDIITVDSDS ICAAMKDLFE DVRAVAEPSG ALALAGMKKY IAQHNIRGER LAHVLSGANV NFHGLRYVSE RCELGEQREA LLAVTIPEEK GSFLKFCQLL GGRSVTEFNY RFADAKDACI FVGVRLSRGV EERKEILSLL HDGGYSVVDL SDDEMAKLHV RYMVGGRPSK PLQERLFSFE FPESPGALLK FLHTLGTHWN ISLFHYRSHG TDFGRVLAAF ELGEHEPDFE TRLNELGYDC HDETHNPAFR FFLAG
|
| |