Gene SNSL254_pSN254_0098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_pSN254_0098 
SymboltraW 
ID4929529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_009140 
Strand
Start bp84789 
End bp86054 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content53% 
IMG OID642572397 
Producttype IV conjugative transfer system protein TraW 
Protein accessionYP_001101972 
Protein GI134047266 
COG category 
COG ID 
TIGRFAM ID[TIGR02742] type-F conjugative transfer system pilin assembly protein TrbC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.122831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGAT CATTGGCCGC GCATATCCCC TGTTCTAAGA GCGTTCTGGT GGCGTTGATG 
TTCTCTGTGG CTGGCGGGGC ATACGCTCAA GAGTCTCCGC TCACAGAGCA GGATAAGGCG
CTTATTGAGC AAGGAAAGCA AATTGCCCAA AAGGCCCAGA AGATGGAAAT GCCATCTCTG
TTGCAAAACC AACACATGGA CGAGGCTCAG GCCGAAGCCA AGGCATTTTT CAAGCAGCTC
CAAACTACTA ACCCAACGCT CAAGGAGATG CACCGGAAAC AGGCTGAAAA GGGTATCTAC
TCTGACCATC GGATACTGGT TTTCGCCTCG TTGTCTCTTG GCGAACAGGG GTTAGATGAC
GTCCTAACGG CGGTGTCAGG CCAGCCTGAT TCTGTAATTG TGTTCCGTGG CATCCCGGAA
GGAATGAACT TGGGGCAGGG AGTTAAAGCT ATTCAGGCGC TCGCGGCCAA AAAAGACCCA
GTGCCGAACA TCATCATCAA CCCTACGTTG TTCAAAACGT ACAACATCAC AGCCGTTCCC
ACGATTGTGA TGCTGGAGGA TGAGCCGCTG CCTGGCGAAC AACCAAACGT CGTCGCCCAG
GTCTCCGGGT TGTCCGACCC GGTATGGTTG GCTCGGGAAG TGGATAACGG AGAAAAAGGC
GATCTCGGCG TTAAGGGGCC GGTGGAGAAA ATCAGTGAGC CAGACCTTAT TGATGTTGCC
AAGAAACGCC TTGCCAATAT CGACTGGGAA GAGAAGAAGA AACAGGCTAT AGAGCGCTTC
TGGACCAAGC AGAATTTCAA TGAGCTGCCA AGAGCGCCAA AATCTCGAAC ACGAGAAATT
GACCCTAGCG TCATGATCAC CAGTGACATC AGCACTCCGG ATGGCACTGT GTTCGCTCAC
GCGGGTGACG TGATCAACCC ATTGTGCGAT CCGAAGGAAG TTTGCAAGCC TGGAACGCGG
CCATTTACCC AAGCGGTCGT AGTTTTCGAC CCGCTGGACA AAAAGCAAAT GGAACTACTC
GCCAAGAAGC TGCCTGAAAT CAAGCTGGAG CCTGGCGTAC AACGGATCAC CTATATCGCC
ACAGAGTTCG ACAAAGACAA AGGCTGGGAT TCCTACAAGA GTGTCACCGA CAACTTTGAC
GCGCCGGTAT ATCTGCTGAC GCCAGATCTG ATTACCCGGT TCGAGCTGGA GCACACACCG
AGCGTCATTA CTGCCAGAGG CAAGAAGTTT GTTGTCCGCG AACTTGCTGA GGAGGGCGGT
GAATGA
 
Protein sequence
MIRSLAAHIP CSKSVLVALM FSVAGGAYAQ ESPLTEQDKA LIEQGKQIAQ KAQKMEMPSL 
LQNQHMDEAQ AEAKAFFKQL QTTNPTLKEM HRKQAEKGIY SDHRILVFAS LSLGEQGLDD
VLTAVSGQPD SVIVFRGIPE GMNLGQGVKA IQALAAKKDP VPNIIINPTL FKTYNITAVP
TIVMLEDEPL PGEQPNVVAQ VSGLSDPVWL AREVDNGEKG DLGVKGPVEK ISEPDLIDVA
KKRLANIDWE EKKKQAIERF WTKQNFNELP RAPKSRTREI DPSVMITSDI STPDGTVFAH
AGDVINPLCD PKEVCKPGTR PFTQAVVVFD PLDKKQMELL AKKLPEIKLE PGVQRITYIA
TEFDKDKGWD SYKSVTDNFD APVYLLTPDL ITRFELEHTP SVITARGKKF VVRELAEEGG
E