Gene SeD_A3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3572 
Symbol 
ID6872400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3427849 
End bp3429417 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content50% 
IMG OID642786560 
Productmethyl-accepting chemotaxis protein II 
Protein accessionYP_002217196 
Protein GI198243367 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.258237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTGC ATAACATTAA AATACGTTCA AAATTATTTA TGGCCTTTGG CTTATTCATT 
GTTCTCATGG TGGTGAGTTC CGCTCTGTCT TTGTTTAGCC TTGATCGGGC TAATACGGGT
ATGCAGGACA TTATTACCAA TGATTATCCC ACCACGGTAA AAGCCAATCT GTTAATCGAT
AATTTTAATG ATTTCATCAT CGCGCAGCAG CTCATGTTAC TGGATGAAGA GGGGCGCTGG
AGCCAGAGCT CGCAGAAAGA ACTCAGTGAG ATAAGCCAGC GCATTTCGGC GCTACTGGAT
GAGCTTTCCA GGGAAAATAG TCACGATGCG GATTCACAGA AAATCATTAA TGAGATCCGT
GAAGCGCGCC AGCAATACCT GGAGTCCAGA TTCCGTATTT TGAAAGATAT TCAAAGCAAT
AATCGTCAGG CGGCCATTCA GGAGATGATG ACCAGAACGG TGCAGGTGCA AAAAGTCTAT
AAGGACAAAG TCCAGGAACT TATCGCTGTT CAGGACGCGC AGATGCATGA AGCGAGCGTG
CAGGTCAAAG AGGATTTTAA AAATAATCGG ACGCTGTTAA TCACTTTGGC GCTGATAAGC
ATCGCCGCCG GAGGCGTAAT GGGATGGTAT ATTGTGCGTT CTATTACCCG GCCGCTTGAT
GACGCAGTAC GCTTTGCCGA GGCGATTGCC GATGGCGATC TGACTCGCCA TATCACCACC
GACTATAAAG ATGAAACAGG CGTACTACTG CAAGCGTTAA TGGCGATGAA AACGCGTCTA
CTGGATATCG TACAGGAAGT GCAAAACGGT TCGGAAAGTA TCTCCACAGC GGCGGCGCAA
ATTGTCGCCG GTAACCAGGA TTTGGCGGCG CGTACGGAAG AGCAGGCCAG CTCGGTTGAA
GAAACGGCGG CGTCGATGGA ACAGATTACC GCCACGGTTA AAAATACGGC TGACCATACC
AGTGAAGCGA CCAAACTCTC TGCCGGCGCC GCCAGCGTAG TGAAAAACAA TGGGGAGATG
ATGAATCAAG TGACGCAGAA AATGCGCGTC ATTAACGATA CGGCAAATCG TATGTCGGAT
ATCATCAATA TCATTGATTC CATTGCCTTT CAGACCAATA TTCTGGCGCT GAACGCGGCG
GTTGAAGCGG CGCGCGCGGG CGAACATGGA CGTGGTTTTG CCGTTGTCGC CGGAGAGGTT
CGCCAGTTGG CGCAAAAGAG CGCCTCGTCA GCCAGTGAAA TCCGTAATTT GATTGAAGAT
TCAACCAGTC AGACTCAGGA AGGGATGCAC CTGGTGGAGA AAGCCAGCGC CCTGATTAAT
GGCATGGTGG ATAACGTCGA AGAGATGGAT GTGATATTAC GTGAGATTGG GCAGGCCAGC
CGCGAGCAAA CTGACGGTAT TTCGCAGATT AACAGCGCGA TTGGCCTGAT TGACGCCGCC
ACGCAACAAA ACTCCTGCCT TGTGGAAGAG TCTGTTGCCG CCGCGGCGTC GCTGAACGAA
CAGGCGTTAC ATTTAAAAGA GCTGGTTAAC GTGTTCCGCG TCCGCGAAGA GGACACGCAG
CCCGCTTAA
 
Protein sequence
MFLHNIKIRS KLFMAFGLFI VLMVVSSALS LFSLDRANTG MQDIITNDYP TTVKANLLID 
NFNDFIIAQQ LMLLDEEGRW SQSSQKELSE ISQRISALLD ELSRENSHDA DSQKIINEIR
EARQQYLESR FRILKDIQSN NRQAAIQEMM TRTVQVQKVY KDKVQELIAV QDAQMHEASV
QVKEDFKNNR TLLITLALIS IAAGGVMGWY IVRSITRPLD DAVRFAEAIA DGDLTRHITT
DYKDETGVLL QALMAMKTRL LDIVQEVQNG SESISTAAAQ IVAGNQDLAA RTEEQASSVE
ETAASMEQIT ATVKNTADHT SEATKLSAGA ASVVKNNGEM MNQVTQKMRV INDTANRMSD
IINIIDSIAF QTNILALNAA VEAARAGEHG RGFAVVAGEV RQLAQKSASS ASEIRNLIED
STSQTQEGMH LVEKASALIN GMVDNVEEMD VILREIGQAS REQTDGISQI NSAIGLIDAA
TQQNSCLVEE SVAAAASLNE QALHLKELVN VFRVREEDTQ PA