Gene SeSA_A4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4034 
SymboltorC 
ID6519262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3905143 
End bp3906327 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content53% 
IMG OID642749004 
Producttrimethylamine-N-oxide reductase c-type cytochrome TorC 
Protein accessionYP_002116766 
Protein GI194737058 
COG category[C] Energy production and conversion 
COG ID[COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit 
TIGRFAM ID[TIGR02162] trimethylamine-N-oxide reductase c-type cytochrome TorC 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC TCTGGAGAGC GCTACTCAGG CCAAGCGCCC GTTGGTCGAT ACTGGCGCTG 
GTCATTGTCG GGATTGTGAT CGGCGTTGCG CTGATCGTGT TACCTCACGT CGGTATCAAA
CTGACCAGTA CGACAGAGTT TTGCGTCAGC TGCCATAGTA TGCAGCCGGT GTATCAGGAA
TATAAACAGT CCGTACATTT CCAGAACGCT TCCGGCGTAC GCGCGGAATG TCACGATTGC
CATATCCCCC CTGATATTCC AGGCATGGTG AAACGTAAGC TGGAAGCCAG CAACGATCTT
TACCAGACAT TTATCGCCCA CTCGATTGAT ACCCCGGAAA AATTTGAAGC CAAACGTGCC
GAGCTTGCCG AGCGTGAATG GGCGCGCATG AAAGAGAATA ACTCCGCGAC CTGTCGTTCC
TGCCATAACT ACGATGCGAT GGATCACGCG AAACAGAACC CGGAAGCGGC GCGGCAAATG
AAAATCGCCG CGAAAGAAAA TCAGTCCTGC ATCGACTGCC ATAAAGGGAT TGCCCACCAG
CTACCGGATA TGAGCAGCGG TTTCCGCAAA CAGTTTGATG AACTGCGCGC CAGCGCCAGT
ACGCATAATG ACGGCGATAC GCTCTATTCG CTGGATATCA AGCCGATTTA CGCCGCTAAA
GGCGATAAAG AACCGGCAGG TTCGTTGTTA CCTGCTTCTG AAGTGAAAGT CCTTAAACGG
GACGGTGACT GGCTGCAAGT GCAAATCGAA GGCTGGACGG AGACGGACGG TCGTCAGCGC
GTGCTGACGC AGTTGCCCGG TAAACGTATT TTTGTCGCTT CGATTCGCGG CGATGTGCAA
CAGCATGTGA AAACGCTGGA AGAGACCACC GTCGCGGCGA CCAATACTCA GTGGAGCAAA
TTACAGGCAA CCGCGTGGAT GCAAAAAGGC GACATGGTAA ATGACATTAA ACCGATTTGG
GCCTATGCCG ACTCCCTCTA TAACGGCACC TGTAATCAGT GTCACGGCGC GCCGGACAAA
GCGCACTTTG ACGCTAACGG CTGGATCGGC ACGCTCAACG GCATGATCGG TTTCACCAGT
CTGGATAAGC GTGAAGAACG TACCTTGTTG AAATATCTCC AGATGAATGC GTCTGATACC
ACCAATACGC CGCACAGCGA TAAGGGAGAA CACAATGAAA AATAA
 
Protein sequence
MRKLWRALLR PSARWSILAL VIVGIVIGVA LIVLPHVGIK LTSTTEFCVS CHSMQPVYQE 
YKQSVHFQNA SGVRAECHDC HIPPDIPGMV KRKLEASNDL YQTFIAHSID TPEKFEAKRA
ELAEREWARM KENNSATCRS CHNYDAMDHA KQNPEAARQM KIAAKENQSC IDCHKGIAHQ
LPDMSSGFRK QFDELRASAS THNDGDTLYS LDIKPIYAAK GDKEPAGSLL PASEVKVLKR
DGDWLQVQIE GWTETDGRQR VLTQLPGKRI FVASIRGDVQ QHVKTLEETT VAATNTQWSK
LQATAWMQKG DMVNDIKPIW AYADSLYNGT CNQCHGAPDK AHFDANGWIG TLNGMIGFTS
LDKREERTLL KYLQMNASDT TNTPHSDKGE HNEK