Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3573 |
Symbol | |
ID | 6872692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3429805 |
End bp | 3431325 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642786561 |
Product | aerotaxis receptor |
Protein accession | YP_002217197 |
Protein GI | 198242077 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein [COG2202] FOG: PAS/PAC domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0352648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.227962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCTC ATCCCTACGT CAGCCAGCTA AATACCCCGC TGGATGATGA TACCACTCTG ATGTCTACGA CCGACCTGGA AAGCTATATC ACTCACGCCA ATGACACTTT TGTCCAGGTG AGCGGCTATC AGTTAAACGA GTTACTGGCG CAGCCACATA ATCTGGTGCG TCATCCGGAT ATGCCGAAAG CTGCCTTCGC AGATATGTGG TACACCCTAA AACAGGGCGA ACCGTGGAGC GGCATTGTGA AAAACCGGCG TAAAAACGGC GACCATTATT GGGTGAGGGC CAACGCGGTA CCGATGATAC GTGAAGGGCG TGTGACGGGA TATATGTCGA TCCGTACCCG CGCCACGGAT GATGAGATTG CCGCCGTCGA GCCTTTATAT CAGGCGCTAA ATGAAGGGCG GTGTAGTAAA CGTATTCATA AAGGCCTGGT GGTTCGTCAG GGCTTGCTGG GCAAACTGCC CGCTATGCCT GTTCGCTGGC GAGTGCGTAG TATTATGGGG CTAATGGCCG TAATGCTGGC GTTGGCGCTG TTCGGTACGG ATGCCTCATG GCAGGCGTTG TTGTTGGGCG CGTTGGCGAT GCTGGCAGGT ACGGCGCTAT TTGAATGGCA AATTGTGCGT CCCATTGAAA ATGTGGCGAC GCAGGCGCTG AAAGTGGCGA CCGGCGAACG CAACAGCGTA CAACACCTTA ATCGTAGCGA TGAGTTGGGG CTGATGCTGA GGGCCGTGGG GCAGCTTGGC TTGATGTGCC GCTGGCTGAT CAATGACGTA TCAAGTCAGG TTTCCAGCGT CAGAAACGGC AGTGAAAGGC TGGCGAAGGG TAACAATGAT CTGAACGAAC ACACCCGTCA GACCGTGGAG AATGTTCAGG AAACGGTAAC GACCATGAAC CAGATGGCGG AGTCCGTGAA GCTCAATTCC GAGACGGCTT CCGCTGCGGA TAAGCTTTCC ATGGCGGCCA GTAGCGCGGC GACTCAGGGA GGTGAGGCGA TGGATACGGT GATTAAAACG ATGGATGATA TCGCTCACAG TACGCAACGT ATCGGGACGA TCACCACGCT AATTAACGAT ATCGCTTTTC AGACGAATAT CCTGGCGCTG AATGCGGCGG TAGAAGCGGC GAGAGCGGGC GAGCAGGGGA AAGGGTTTGC CGTGGTTGCT GGCGAGGTAC GTCATCTTGC CAGCCGTAGC GCTAATGCGG CGAACGATAT TCGTAAATTA ATTGACGCCA GCGCAACAAA GGTGCAGTCA GGCTCCGAGC AGGTTCACGC CGCAGGCCGT ACCATGGATG ACATTGTAGC CCAGGTGCAA AATGTCACCC TGCTTATCGC ACGGATCAGC CAGTCGACGC AGGAACAGAC AGATGGGCTT TCCAGCCTGA CTCGCGCCGT GGACGAGTTG AACCGCATAA CCCAGAAGAA TGCGGCGCTG GTGGAAGAGA GCGCACAAGT CTCCGCAATG GTAAAACACC GTGCCAGCCG GCTGGAGGAT GCGGTCACGG TACTGCATTA A
|
Protein sequence | MSSHPYVSQL NTPLDDDTTL MSTTDLESYI THANDTFVQV SGYQLNELLA QPHNLVRHPD MPKAAFADMW YTLKQGEPWS GIVKNRRKNG DHYWVRANAV PMIREGRVTG YMSIRTRATD DEIAAVEPLY QALNEGRCSK RIHKGLVVRQ GLLGKLPAMP VRWRVRSIMG LMAVMLALAL FGTDASWQAL LLGALAMLAG TALFEWQIVR PIENVATQAL KVATGERNSV QHLNRSDELG LMLRAVGQLG LMCRWLINDV SSQVSSVRNG SERLAKGNND LNEHTRQTVE NVQETVTTMN QMAESVKLNS ETASAADKLS MAASSAATQG GEAMDTVIKT MDDIAHSTQR IGTITTLIND IAFQTNILAL NAAVEAARAG EQGKGFAVVA GEVRHLASRS ANAANDIRKL IDASATKVQS GSEQVHAAGR TMDDIVAQVQ NVTLLIARIS QSTQEQTDGL SSLTRAVDEL NRITQKNAAL VEESAQVSAM VKHRASRLED AVTVLH
|
| |