Gene SeHA_C4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4157 
SymboltorT 
ID6491112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4039539 
End bp4040579 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content55% 
IMG OID642744252 
ProductTMAO reductase system periplasmic protein TorT 
Protein accessionYP_002047856 
Protein GI194451047 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR02955] TMAO reductase system periplasmic protein TorT 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC TTATTTCGTT CTTTTTTTTG ATAATCATCG TCTCGAATGT AGAAACGGCG 
TCAGCCGAAA CCGGGCTGTT ACACTGGACG CGCGCCGATC GGGCGATTCC CTGGCGGCAA
ACGTCAGTCA ATGCGAGTAA ACCCTGGAAA CTGTGCGCAT TATACCCTAG CCTGAAAGAT
TCTTACTGGT TGTCCGTCAA CTACGGGATG CAAAAAGCGG CCAAATTGTA TGGCGTTGAT
TTAAAAGTGC TGGAAGCGAA TGGCTATCGG CAACTGGCGA CGCAACAACA GCAAATGACG
CAGTGCCGGG AGTGGGGCGC CGACGCTATT CTGCTTGGCA GCAGTACCGA TCGTTTCCCG
GAACTGGAAC GGTATGCCGG TAATGTGCCG GTTATTGAAC TGGTGAACAT GATTCACGAT
GCCAGCGTCG CAACCCGCAT TGGTCTGCCG TGGTTTCAGA TGGGGTATCT GCCGGGACGT
TTCCTGGTGC AGTGGAGCAA AGGAAAAGCG CTTAACGTTT TACTCTTCCC GGGGCCGGAA
GAAGCCGGCG GCAGTCAGGA GATGGTGGCG GGTTTTCGTC AGGCAATTAA AGGCAGCGCC
ATCAACATTG TGGATATCGC CTGGGGCGAT AACGACATTG AAGTGCAGCG AAACTTACTC
CAGGAAATGC TGGAGCGCCA TCCGGACGCC AATGTGGTCG CCGGGTCGGC GATAGCGGCG
GAGGCGGCGA TGGGCGAAGG GCGTAATTTG ACGACTCCGC TCACGATCGT CTCATTTTAT
CTGACGCATC AGGTTTATCG CGGTTTAAAA CGCGGCCATA TCCTGATGGC GCTCAGCGAT
CAGATGGCCT GGCAGGGAGA ATTAGCGATA ACGCAGTCGA TTAAGGTCTT ACAGGGGCAG
CCGGTGCCTG AAAATATCAG CCCGCCGGTG CTTATTTTGA CGCATAACAA CGCCGACAGC
GCGCGCGTTC GCCGTTCGCT ATCGCCTCCG GGATTTCGGC CCGTCTATCT GTATCAATAC
ACCTCCGAGG CTAAAAAGTA G
 
Protein sequence
MRALISFFFL IIIVSNVETA SAETGLLHWT RADRAIPWRQ TSVNASKPWK LCALYPSLKD 
SYWLSVNYGM QKAAKLYGVD LKVLEANGYR QLATQQQQMT QCREWGADAI LLGSSTDRFP
ELERYAGNVP VIELVNMIHD ASVATRIGLP WFQMGYLPGR FLVQWSKGKA LNVLLFPGPE
EAGGSQEMVA GFRQAIKGSA INIVDIAWGD NDIEVQRNLL QEMLERHPDA NVVAGSAIAA
EAAMGEGRNL TTPLTIVSFY LTHQVYRGLK RGHILMALSD QMAWQGELAI TQSIKVLQGQ
PVPENISPPV LILTHNNADS ARVRRSLSPP GFRPVYLYQY TSEAKK