Gene SeSA_A4036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4036 
SymboltorT 
ID6517202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3907122 
End bp3908162 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content55% 
IMG OID642749006 
ProductTMAO reductase system periplasmic protein TorT 
Protein accessionYP_002116768 
Protein GI194736586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR02955] TMAO reductase system periplasmic protein TorT 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC TTATTTCGTT CTTTTTTTTG ATAATCATCG TCTCGAATGT AGAAACGGCG 
TCAGCCGAAA CCGGACTGCT ACACTGGACG CGCGCCGATC GGGCGATTCC CTGGCGGCAA
ACGTCAGTCA ATGCGAGTAA ACCCTGGAAA CTGTGCGCAT TATACCCCAG CCTGAAAGAT
TCTTACTGGT TGTCCGTCAA CTACGGGATG CAAAAAGCGG CCAAATTGTA TGGCGTTGAT
TTAAAAGTGC TGGAAGCGAA TGGCTATCGG CAACTGGCGA CGCAACAACA GCAAATGATG
CAGTGCCGGG AGTGGGGTGC CGACGCTATT CTGCTTGGCA GCAGTACCGA TCGTTTCCCG
GAGCTGGAAC GGTATGCCGG TAATGTGCCG GTTATTGAAC TGGTGAACAT GATTCACGAT
GCCAGCGTCG CAACCCGCGT TGGTCTGCCG TGGTTTCAGA TGGGGTATCT GCCGGGACGT
TTCCTGGTGC AGTGGAGCAA AGGAAAAGCG CTTAACGTTT TACTCTTCCC GGGGCCGGAA
GAAGCCGGCG GCAGTCAGGA GATGGTGGCG GGTTTTCGTC AGGCAATTAA AGGCAGCGCG
ATCAACATTG TGGATATCGC CTGGGGCGAT AACGACATTG AAGTGCAGCG AAACTTACTC
CAGGAAATGC TGGAGCGCCA TCCGGACGCC AATGTGGTCG CCGGGTCGGC GATAGCAGCG
GAGGCGGCGA TGGGCGAAGG GCGAAATTTG ACGACTCCGC TCACGATCGT CTCATTTTAT
CTGACGCATC AGGTTTATCG CGGTTTAAAA CGCGGCCATA TCCTGATGGC GCTCAGCGAT
CAGATGGCCT GGCAGGGAGA ATTAGCGATA ACGCAGTCGA TTAAGGTCTT ACAGGGGCAA
CCGGTGCCTG AAAATATCAG CCCGCCGGTG CTTATTTTGA CGCATAACAA CGCCGACAGC
GCGCGCGTTC GCCGTTCGCT ATCGCCTCCG GGATTTCGGC CCGTCTATCT GTATCAATAC
ACCTCCGAGG CTAAAAAGTA G
 
Protein sequence
MRALISFFFL IIIVSNVETA SAETGLLHWT RADRAIPWRQ TSVNASKPWK LCALYPSLKD 
SYWLSVNYGM QKAAKLYGVD LKVLEANGYR QLATQQQQMM QCREWGADAI LLGSSTDRFP
ELERYAGNVP VIELVNMIHD ASVATRVGLP WFQMGYLPGR FLVQWSKGKA LNVLLFPGPE
EAGGSQEMVA GFRQAIKGSA INIVDIAWGD NDIEVQRNLL QEMLERHPDA NVVAGSAIAA
EAAMGEGRNL TTPLTIVSFY LTHQVYRGLK RGHILMALSD QMAWQGELAI TQSIKVLQGQ
PVPENISPPV LILTHNNADS ARVRRSLSPP GFRPVYLYQY TSEAKK