Gene SNSL254_A2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2654 
Symbol 
ID6482714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2569422 
End bp2570609 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content63% 
IMG OID642737987 
Productethanolamine utilization protein EutG 
Protein accessionYP_002041721 
Protein GI194444080 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0517138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCTG AACTACAGAC GGCGCTGTTT CAGGCATTCG ACACCCTGAA TCTGCAACGG 
GTGAAAACGT TCAGCGTACC GCCGGTCACG CTGTGCGGAC TTGGGGCGCT CGGCGCCTGT
GGACAGGAAG CGCAAGCGCG AGGCGTAAGC CATCTGTTTG TGATGGTCGA CAGCTTCCTG
CATCAGGCGG GAATGACCGC GCCGCTGGCA CGCAGCCTGG CGATGAAAGG CGTGGCGATG
ACAGTCTGGC CGTGTCCGCC AGGCGAGCCG TGCATCACCG ATGTTTGCGC GGCGGTGGCG
CAACTGCGTG AGGCGGCGTG CGACGGCGTA GTGGCCTTTG GCGGCGGTTC GGTGCTGGAC
GCGGCGAAAG CGGTCGCCCT GCTGGTGACT AACCCTGACC AGACGCTGAG CGCCATGACC
GAGCGCAGTA CATTACGCCC GCGTCTGCCG CTGATTGCAG TGCCGACCAC CGCCGGAACC
GGTTCTGAAA CCACCAACGT GACGGTGATT ATCGACGCGG TCAGCGGGCG CAAGCAGGTG
CTGGCGCACG CGTCACTAAT GCCGGACGTG GCGATTCTTG ATGCTGCCGT GACCGAAGGC
GTTCCGCCAA ACGTGACGGC GATGACCGGT ATCGATGCGT TGACGCATGC GATTGAGGCC
TACAGCGCGC TCAACGCCAC GCCGTTTACC GACAGCCTGG CGATTGGCGC GATAGCGATG
ATTGGCAAAT CGCTGCCGAA AGCCGTGGGT TACGGCCACG ATCTGGCGGC GCGTGAAAAT
ATGTTGCTGG CCTCCTGTAT GGCGGGAATG GCCTTTTCCA GCGCCGGTTT GGGGCTGTGT
CATGCGATGG CGCACCAGCC TGGGGCGGCG CTGCATATTC CGCACGGCCA GGCCAACGCC
ATGCTGCTGC CAACAGTCAT GGGCTTTAAC CGGATGGTTT GCCGCGAGCG CTTCAGTCAA
ATCGGTCGGG CGTTAACCAA TAAGAAATCG GACGATCGCG ATGCGATTGC GGCGGTGAGC
GAGCTGATTG CCGAAGTGGG GCAGAGCAAA CGGCTGGCTG ATGCTGGCGC CAAACCCGAA
CACTACAGCG CGTGGGCGCA AGCCGCGCTG GAGGATATTT GTCTGCGCAG TAACCCACGC
ACCGCCACAC AGGCACAGAT TATCGACCTG TACGCGGCTG CCGGGTAA
 
Protein sequence
MQAELQTALF QAFDTLNLQR VKTFSVPPVT LCGLGALGAC GQEAQARGVS HLFVMVDSFL 
HQAGMTAPLA RSLAMKGVAM TVWPCPPGEP CITDVCAAVA QLREAACDGV VAFGGGSVLD
AAKAVALLVT NPDQTLSAMT ERSTLRPRLP LIAVPTTAGT GSETTNVTVI IDAVSGRKQV
LAHASLMPDV AILDAAVTEG VPPNVTAMTG IDALTHAIEA YSALNATPFT DSLAIGAIAM
IGKSLPKAVG YGHDLAAREN MLLASCMAGM AFSSAGLGLC HAMAHQPGAA LHIPHGQANA
MLLPTVMGFN RMVCRERFSQ IGRALTNKKS DDRDAIAAVS ELIAEVGQSK RLADAGAKPE
HYSAWAQAAL EDICLRSNPR TATQAQIIDL YAAAG