Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1989 |
Symbol | |
ID | 4027073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2246371 |
End bp | 2248041 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637967184 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_574039 |
Protein GI | 92114111 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00286784 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAATA TTCTAAATAA AAAACAGCTG AATAAAATTA GCGTCAAGTA TGCGGCCGCT TTTATTGCGG TGGCATTGTC GATGTTGACG ATCGTGGTGG TGGACGAGCG ATTGGTCAAT CTCGTCAAGA AGCGGATGAC CGCCTTCAGT GGTGAATTCA ATGTGGCGAT TTCCAATGTC CTGAATGCGG ATCGAGACCT GTATCAGGGG CAAATGTCGG TACTGGAGCT TCTAGATGAA GAGCCAGAAA GCGACGAAAG TCAGCAATTC ACAAAAGCTT ATCAGGAAAA CGCCGAGCAG GCTTATGATC GCATGCAGTC ATTCGCGGCA CTCATGTCGA AATATCCGGA CGTAGTAACG GAGCTCGAAG AATTCGAAGC ACGCTTCGAC ACCTGGGTAG GTGTATCCGA TGATATCGTC GAGTTGCACC GAGGTGGCGA AGTAGAGAAA GCGCAAACAT TATTCGATGA AGAGAGCACG CGTCGCTTCA ATGAGCTGCG CGAAATTTAC GATCTTGCTG GCCAGGCCGT GAATGCAAAA GTGACCGAAC TCGAAGCATC GATGCTTGAC CGGATCAATA CCCAGCAGAC GTGGGTCATC GCTATTTCGG CGGTCATCTT CATGGCTGCC ATCGCGCTCG CGTTGATCGG CCCACGTAGA ATGTCGAAAG CGATTCGGGA CGTCAGTGAG CGAATCCGCG ATATTGCCGA GGAAGACGGT GACCTGACTC AGCGTATTGA ATCCCGCCGT AGTGACGAAA TCGGCGAGCT GAGTGAACGC TTCAATGGCT TCATCGACCG GATAGATGGA ACGTTGCAAG CGGTTCGCTC AAGTACGCTT AGTGTCAACT CGGCCTCGGA CGAAATCGCC AAGAGCAGTC AGGACCTTGC GTCACGTACC GAGCAGACGG CATCCAACCT GCAGGAAACC TCGTCGTCGA TGGAAGAAAT TACCGCGACG GTTCGTCACA CGGCCGAGAC GGCCACACAA GCCGATGAAC TATCACGCCG CAGCGTGGAG GTGACTCGTG CCGGTCAGAC TGCGATGCAA GACGTCGAAA CGACCATGAA AGATATCGGC GAGTCCGCAT CACGCATCAA CGAGATCATC GGCATGATCG ATTCGATTGC CTTTCAGACC AACATCTTGG CGCTGAATGC GTCCGTCGAG GCGGCACGAG CCGGTGAGCA TGGCAAGGGC TTCGCCGTCG TCGCCGAGGA GGTCCGCACG CTGGCCGGGC GCTCGAGCGA TGCCTCACGT GAAATTCGCG AACTGGTCAA CAACTCCGTC GCCAATGCCG AGTCAGGTTC GGAGAAAGTG CGTAAAGCCG GCCAGACGAT GGAAGACATT GCCGGTAGCA TCGAGCGAGT TACGCAACTG ATCGGTGAAA TCAGCACCGG CTCCCAGGAA CAGAGCAGCG GCATCAGCCA GGTCAACACG GCCGTGACCG AACTGGATAC CATGACGCAA CAGAATGCGG CGATGGTTGA GCAAAGCAGT GCCGCCGCCG ATGAAATGAG CCATCAAGCG GAACGTCTCA TGGCCCTGAT CGACTCCTTC AAGTTGAGCC ATACCGAAGA AGGAGAATCA CCAGTGCGTC AGGCCCTCGT ACCTAGACAG GGTGAAGCAT CGTACGCCGG TAAAAGACAG GCCCCGCTGC TGGCTGGCTG A
|
Protein sequence | MSNILNKKQL NKISVKYAAA FIAVALSMLT IVVVDERLVN LVKKRMTAFS GEFNVAISNV LNADRDLYQG QMSVLELLDE EPESDESQQF TKAYQENAEQ AYDRMQSFAA LMSKYPDVVT ELEEFEARFD TWVGVSDDIV ELHRGGEVEK AQTLFDEEST RRFNELREIY DLAGQAVNAK VTELEASMLD RINTQQTWVI AISAVIFMAA IALALIGPRR MSKAIRDVSE RIRDIAEEDG DLTQRIESRR SDEIGELSER FNGFIDRIDG TLQAVRSSTL SVNSASDEIA KSSQDLASRT EQTASNLQET SSSMEEITAT VRHTAETATQ ADELSRRSVE VTRAGQTAMQ DVETTMKDIG ESASRINEII GMIDSIAFQT NILALNASVE AARAGEHGKG FAVVAEEVRT LAGRSSDASR EIRELVNNSV ANAESGSEKV RKAGQTMEDI AGSIERVTQL IGEISTGSQE QSSGISQVNT AVTELDTMTQ QNAAMVEQSS AAADEMSHQA ERLMALIDSF KLSHTEEGES PVRQALVPRQ GEASYAGKRQ APLLAG
|
| |