Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2828 |
Symbol | |
ID | 6873893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2697614 |
End bp | 2698801 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642785881 |
Product | ethanolamine utilization protein EutG |
Protein accession | YP_002216531 |
Protein GI | 198244858 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCTG AACTACAGAC GGCGCTGTTT CAGGCATTCG ACACCCTGAA TCTGCAACGG GTGAAAACGT TCAGCGTACC GCCGGTCACG CTGTGCGGGC TTGGGGCGCT CGGCGCCTGC GGCCAGGAAG CGCAAGCGCG GGGCGTAAGC CATCTGTTTG TGATGGTCGA CAGCTTCCTG CATCAGGCGG GAATGACCGC GCCGCTGGCA CGCAGCCTGG CGATGAAAGG TGTGGCGATG ACGGTCTGGC CGTGTCCGCC AGGCGAGCCG TGCATTACCG ATGTCTGCGC GGCGGTGGCG CAACTGCGTG AGGCGGCGTG CGACGGCGTA GTGGCCTTTG GCGGCGGTTC GGTGCTGGAC GCGGCGAAAG CGGTCGCCCT GCTGGTGACT AACCCTGACC AGACGCTGAG CGCCATGACC GAGCACAGTA CATTACGCCC GCGTCTGCCG CTGATTGCGG TGCCGACCAC CGCCGGAACC GGTTCTGAAA CCACCAACGT GACGGTGATT ATCGACGCGG TCAGCGGGCG CAAGCAGGTA CTGGCACACG CGTCACTAAT GCCGGACGTG GCGATTCTTG ATGCTGCCGT GACCGAAGGC GTTCCGCCAA ACGTGACGGC GATGACCGGT ATCGATGCGT TGACGCATGC GATTGAGGCC TACAGCGCGC TCAACGCCAC GCCGTTTACC GACAGCCTGG CGATTGGCGC GATAGCGATG ATTGGCAAAT CGCTGCCGAA AGCCGTGGGG TACGGCCACG ATCTGGCGGC GCGTGAAAAT ATGTTGCTGG CCTCCTGTAT GGCGGGAATG GCCTTTTCCA GCGCCGGTTT GGGGCTGTGT CATGCGATGG CGCACCAGCC AGGAGCGGCG CTGCATATTC CGCACGGCCA GGCCAACGCC ATGCTGCTGC CAACAGTCAT GGGCTTTAAC CGGATGGTTT GCCGCGAGCG CTTCAGTCAA ATCGGTCGGG CGTTAACCAA TAAGAAATCG GACGATCGCG ATGCGATTGC GGCGGTGAGC GAGCTGATTG CCGAAGTGGG GCAGAGCAAA CGGCTGGCTG ACGCTGGCGC TAAACCCGAA CACTACAGCG CATGGGCGCA AGCCGCGCTG GAGGATATTT GTCTGCGCAG TAACCCACGC ACCGCCACAC AGGCACAGAT TATCGACCTG TACGCGGCTG CCGGGTAA
|
Protein sequence | MQAELQTALF QAFDTLNLQR VKTFSVPPVT LCGLGALGAC GQEAQARGVS HLFVMVDSFL HQAGMTAPLA RSLAMKGVAM TVWPCPPGEP CITDVCAAVA QLREAACDGV VAFGGGSVLD AAKAVALLVT NPDQTLSAMT EHSTLRPRLP LIAVPTTAGT GSETTNVTVI IDAVSGRKQV LAHASLMPDV AILDAAVTEG VPPNVTAMTG IDALTHAIEA YSALNATPFT DSLAIGAIAM IGKSLPKAVG YGHDLAAREN MLLASCMAGM AFSSAGLGLC HAMAHQPGAA LHIPHGQANA MLLPTVMGFN RMVCRERFSQ IGRALTNKKS DDRDAIAAVS ELIAEVGQSK RLADAGAKPE HYSAWAQAAL EDICLRSNPR TATQAQIIDL YAAAG
|
| |