Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2654 |
Symbol | |
ID | 6482714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2569422 |
End bp | 2570609 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642737987 |
Product | ethanolamine utilization protein EutG |
Protein accession | YP_002041721 |
Protein GI | 194444080 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0517138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 83 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCTG AACTACAGAC GGCGCTGTTT CAGGCATTCG ACACCCTGAA TCTGCAACGG GTGAAAACGT TCAGCGTACC GCCGGTCACG CTGTGCGGAC TTGGGGCGCT CGGCGCCTGT GGACAGGAAG CGCAAGCGCG AGGCGTAAGC CATCTGTTTG TGATGGTCGA CAGCTTCCTG CATCAGGCGG GAATGACCGC GCCGCTGGCA CGCAGCCTGG CGATGAAAGG CGTGGCGATG ACAGTCTGGC CGTGTCCGCC AGGCGAGCCG TGCATCACCG ATGTTTGCGC GGCGGTGGCG CAACTGCGTG AGGCGGCGTG CGACGGCGTA GTGGCCTTTG GCGGCGGTTC GGTGCTGGAC GCGGCGAAAG CGGTCGCCCT GCTGGTGACT AACCCTGACC AGACGCTGAG CGCCATGACC GAGCGCAGTA CATTACGCCC GCGTCTGCCG CTGATTGCAG TGCCGACCAC CGCCGGAACC GGTTCTGAAA CCACCAACGT GACGGTGATT ATCGACGCGG TCAGCGGGCG CAAGCAGGTG CTGGCGCACG CGTCACTAAT GCCGGACGTG GCGATTCTTG ATGCTGCCGT GACCGAAGGC GTTCCGCCAA ACGTGACGGC GATGACCGGT ATCGATGCGT TGACGCATGC GATTGAGGCC TACAGCGCGC TCAACGCCAC GCCGTTTACC GACAGCCTGG CGATTGGCGC GATAGCGATG ATTGGCAAAT CGCTGCCGAA AGCCGTGGGT TACGGCCACG ATCTGGCGGC GCGTGAAAAT ATGTTGCTGG CCTCCTGTAT GGCGGGAATG GCCTTTTCCA GCGCCGGTTT GGGGCTGTGT CATGCGATGG CGCACCAGCC TGGGGCGGCG CTGCATATTC CGCACGGCCA GGCCAACGCC ATGCTGCTGC CAACAGTCAT GGGCTTTAAC CGGATGGTTT GCCGCGAGCG CTTCAGTCAA ATCGGTCGGG CGTTAACCAA TAAGAAATCG GACGATCGCG ATGCGATTGC GGCGGTGAGC GAGCTGATTG CCGAAGTGGG GCAGAGCAAA CGGCTGGCTG ATGCTGGCGC CAAACCCGAA CACTACAGCG CGTGGGCGCA AGCCGCGCTG GAGGATATTT GTCTGCGCAG TAACCCACGC ACCGCCACAC AGGCACAGAT TATCGACCTG TACGCGGCTG CCGGGTAA
|
Protein sequence | MQAELQTALF QAFDTLNLQR VKTFSVPPVT LCGLGALGAC GQEAQARGVS HLFVMVDSFL HQAGMTAPLA RSLAMKGVAM TVWPCPPGEP CITDVCAAVA QLREAACDGV VAFGGGSVLD AAKAVALLVT NPDQTLSAMT ERSTLRPRLP LIAVPTTAGT GSETTNVTVI IDAVSGRKQV LAHASLMPDV AILDAAVTEG VPPNVTAMTG IDALTHAIEA YSALNATPFT DSLAIGAIAM IGKSLPKAVG YGHDLAAREN MLLASCMAGM AFSSAGLGLC HAMAHQPGAA LHIPHGQANA MLLPTVMGFN RMVCRERFSQ IGRALTNKKS DDRDAIAAVS ELIAEVGQSK RLADAGAKPE HYSAWAQAAL EDICLRSNPR TATQAQIIDL YAAAG
|
| |