Gene SNSL254_A4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4106 
SymboltorC 
ID6486013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3993936 
End bp3995120 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content53% 
IMG OID642739362 
Producttrimethylamine-N-oxide reductase c-type cytochrome TorC 
Protein accessionYP_002043071 
Protein GI194444473 
COG category[C] Energy production and conversion 
COG ID[COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit 
TIGRFAM ID[TIGR02162] trimethylamine-N-oxide reductase c-type cytochrome TorC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC TCTGGAGAGC GCTATTCAGG CCAAGCGCCC GTTGGTCGAT ACTGGCGCTG 
GTCATTGTCG GGATTGTGAT CGGCGTTGCG CTGATCGTGT TACCTCACGT CGGTATCAAA
CTGACCAGTA CGACAGAGTT TTGCGTCAGC TGCCATAGTA TGCAGCCGGT GTATCAGGAA
TATAAACAGT CCGTACATTT CCAGAACGCT TCCGGCGTGC GCGCGGAATG CCACGATTGC
CATATCCCCC CTGATATTCC AGGCATGGTG AAACGTAAGC TGGAAGCCAG CAACGATCTT
TACCAGACAT TTATCGCCCA CTCGATTGAT ACCCCGGAAA AATTTGAAGC CAAACGCGCC
GAGCTTGCCG AGCGTGAATG GGCGCGCATG AAAGAGAATA ACTCCGCGAC CTGTCGTTCC
TGCCATAACT ACGATGCGAT GGATCACGCG AAACAGAACC CGGAAGCGGC GCGGCAAATG
AAAATCGCCG CGAAAGAAAA TCAGTCCTGC ATCGACTGTC ATAAAGGGAT TGCCCACCAG
CTACCGGATA TGAGCAGCGG ATTCCGCAAA CAGTTTGATG AACTGCGCGC CAGCGCCAGT
ACGCATAATG ACGGCGATAC GCTCTATTCG CTGGATATCA AGCCGATTTA CGCCGCTAAA
GGCGATAAAG AACCGGCAGG TTCGCTGTTA CCTGCTTCTG AAGTGAAAGT CCTTAAACGG
GACGGTGACT GGCTGCAAGT GCAAATCGAA GGCTGGACGG AGACGGACGG TCGCCAGCGC
GTGCTGACGC AGTTGCCCGG TAAACGTATT TTTGTCGCCT CGATTCGCGG CGATGTGCAA
CAGCATGTGA AAACGCTGGA AGAGACCACC GTCGCGGCGA CTAATACTCA GTGGAGCAAA
TTACAGGCAA CCGCGTGGAT GCAAAAAGGC GACATGGTAA ATGACATTAA ACCGATTTGG
GCCTATGCCG ACTCCCTCTA TAACGGCACC TGTAATCAGT GTCACGGCGC GCCGGACAAA
GCGCACTTTG ACGCTAACGG CTGGATCGGC ACGCTCAACG GCATGATCGG TTTCACCAGT
CTGGATAAGC GTGAAGAACG TACCTTGTTG AAATATCTCC AGATGAATGC GTCTGATACC
ACCAACACGC CGCACAGCGA TAAGGGAGAA AACAATGAAA AATAA
 
Protein sequence
MRKLWRALFR PSARWSILAL VIVGIVIGVA LIVLPHVGIK LTSTTEFCVS CHSMQPVYQE 
YKQSVHFQNA SGVRAECHDC HIPPDIPGMV KRKLEASNDL YQTFIAHSID TPEKFEAKRA
ELAEREWARM KENNSATCRS CHNYDAMDHA KQNPEAARQM KIAAKENQSC IDCHKGIAHQ
LPDMSSGFRK QFDELRASAS THNDGDTLYS LDIKPIYAAK GDKEPAGSLL PASEVKVLKR
DGDWLQVQIE GWTETDGRQR VLTQLPGKRI FVASIRGDVQ QHVKTLEETT VAATNTQWSK
LQATAWMQKG DMVNDIKPIW AYADSLYNGT CNQCHGAPDK AHFDANGWIG TLNGMIGFTS
LDKREERTLL KYLQMNASDT TNTPHSDKGE NNEK