Gene SeHA_C3511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3511 
Symbol 
ID6487845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3405114 
End bp3406634 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID642743639 
Productaerotaxis receptor 
Protein accessionYP_002047253 
Protein GI194447584 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00000000402751 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTTCTC ATCCCTACGT CAGCCAGCTA AATACCCCGC TGGATGATGA TACCACTCTG 
ATGTCTACGA CCGACCTGGA AAGCTATATC ACTCACGCCA ATGACACTTT TGTCCAAGTG
AGCGGCTATC AGTTAAACGA GTTACTGGCG CAGCCACATA ATCTGGTGCG TCATCCGGAT
ATGCCGAAAG CTGCCTTCGC AGATATGTGG TACACCCTAA AACAGGGCGA ACCGTGGAGC
GGCATTGTGA AAAACCGGCG TAAAAACGGC GACCATTATT GGGTGCGGGC CAACGCGGTA
CCGATGATAC GTGAAGGGCG TGTGACTGGA TATATGTCGA TCCGTACCCG CGCCACGGAT
GATGAGGTTG CCGCCGTCGA GCCTTTATAT CAGGCGCTAA ATGAAGGGCG GTGTAGTAAA
CGAATTCATA AAGGCCTGGT GGTTCGTCAG GGTTTGCTGG GCAAACTGCC CGCTATGCCT
GTTCGCTGGC GAGTGCGTAG CATTATGGGG CTAATGGCCG TAATGCTGGC GTTGGCGTTG
TTCGGTACGG ATGCCTCATG GCAGGCGTTG CTGTTGGGCG CGTTGGCGAT GCTGGCAGGT
ACGGCGCTAT TGGAATGGCA AATTGTGCGT CCCATTGAAA ATGTGGCGAC GCAGGCGCTG
AAAGTGGCGA CCGGCGAACG CAACAGCGTA CAACATCTTA ATCGTAGCGA TGAGTTGGGG
CTGACGCTGA GGGCCGTGGG GCAGCTTGGC TTGATGTGCC GCTGGCTGAT CAATGACGTA
TCAAGTCAGG TTTCCAGCGT CAGAAACGGC AGTGAAAGGC TGGCGAAGGG TAATAATGAT
CTGAACGAAC ACACCCGTCA GACCGTGGAG AATGTTCAGG AAACGGTAAC GACCATGAAC
CAGATGGCGG AGTCCGTGAA GCTCAATTCC GAGACGGCTT CCGCTGCGGA TAAGCTTTCC
ATGGCGGCCA GTAGCGCGGC GACTCAGGGA GGTGAGGCGA TGGATACGGT GATTAAAACG
ATGGATGATA TCGCTCACAG TACGCAACGT ATCGGGACGA TCACCACGCT AATTAACGAT
ATCGCTTTTC AGACGAATAT CCTGGCGCTG AATGCGGCGG TAGAAGCGGC GAGAGCGGGC
GAGCAGGGGA AAGGGTTTGC CGTGGTTGCT GGCGAGGTAC GCCATCTTGC CAGCCGCAGC
GCTAATGCGG CGAACGATAT TCGTAAATTA ATTGATGCCA GCGCAACAAA GGTGCAGTCA
GGCTCCGAGC AGGTTCACGC CGCAGGCCGT ACCATGGATG ACATTGTAGC TCAGGTGCAA
AATGTCACCC TGCTTATCGC ACGTATCAGT CAGTCGACGC AGGAACAGAC AGATGGGCTT
TCCAGCCTGA CCCGCGCCGT GGACGAGTTG AACCGCATAA CCCAGAAAAA TGCGGCGCTG
GTGGAAGAGA GCGCACAAGT CTCCGCAATG GTAAAACACC GTGCCAGCCG GCTGGAGGAT
GCGGTCACGG TACTGCATTA A
 
Protein sequence
MSSHPYVSQL NTPLDDDTTL MSTTDLESYI THANDTFVQV SGYQLNELLA QPHNLVRHPD 
MPKAAFADMW YTLKQGEPWS GIVKNRRKNG DHYWVRANAV PMIREGRVTG YMSIRTRATD
DEVAAVEPLY QALNEGRCSK RIHKGLVVRQ GLLGKLPAMP VRWRVRSIMG LMAVMLALAL
FGTDASWQAL LLGALAMLAG TALLEWQIVR PIENVATQAL KVATGERNSV QHLNRSDELG
LTLRAVGQLG LMCRWLINDV SSQVSSVRNG SERLAKGNND LNEHTRQTVE NVQETVTTMN
QMAESVKLNS ETASAADKLS MAASSAATQG GEAMDTVIKT MDDIAHSTQR IGTITTLIND
IAFQTNILAL NAAVEAARAG EQGKGFAVVA GEVRHLASRS ANAANDIRKL IDASATKVQS
GSEQVHAAGR TMDDIVAQVQ NVTLLIARIS QSTQEQTDGL SSLTRAVDEL NRITQKNAAL
VEESAQVSAM VKHRASRLED AVTVLH