Gene Csal_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1989 
Symbol 
ID4027073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2246371 
End bp2248041 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content55% 
IMG OID637967184 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_574039 
Protein GI92114111 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00286784 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAATA TTCTAAATAA AAAACAGCTG AATAAAATTA GCGTCAAGTA TGCGGCCGCT 
TTTATTGCGG TGGCATTGTC GATGTTGACG ATCGTGGTGG TGGACGAGCG ATTGGTCAAT
CTCGTCAAGA AGCGGATGAC CGCCTTCAGT GGTGAATTCA ATGTGGCGAT TTCCAATGTC
CTGAATGCGG ATCGAGACCT GTATCAGGGG CAAATGTCGG TACTGGAGCT TCTAGATGAA
GAGCCAGAAA GCGACGAAAG TCAGCAATTC ACAAAAGCTT ATCAGGAAAA CGCCGAGCAG
GCTTATGATC GCATGCAGTC ATTCGCGGCA CTCATGTCGA AATATCCGGA CGTAGTAACG
GAGCTCGAAG AATTCGAAGC ACGCTTCGAC ACCTGGGTAG GTGTATCCGA TGATATCGTC
GAGTTGCACC GAGGTGGCGA AGTAGAGAAA GCGCAAACAT TATTCGATGA AGAGAGCACG
CGTCGCTTCA ATGAGCTGCG CGAAATTTAC GATCTTGCTG GCCAGGCCGT GAATGCAAAA
GTGACCGAAC TCGAAGCATC GATGCTTGAC CGGATCAATA CCCAGCAGAC GTGGGTCATC
GCTATTTCGG CGGTCATCTT CATGGCTGCC ATCGCGCTCG CGTTGATCGG CCCACGTAGA
ATGTCGAAAG CGATTCGGGA CGTCAGTGAG CGAATCCGCG ATATTGCCGA GGAAGACGGT
GACCTGACTC AGCGTATTGA ATCCCGCCGT AGTGACGAAA TCGGCGAGCT GAGTGAACGC
TTCAATGGCT TCATCGACCG GATAGATGGA ACGTTGCAAG CGGTTCGCTC AAGTACGCTT
AGTGTCAACT CGGCCTCGGA CGAAATCGCC AAGAGCAGTC AGGACCTTGC GTCACGTACC
GAGCAGACGG CATCCAACCT GCAGGAAACC TCGTCGTCGA TGGAAGAAAT TACCGCGACG
GTTCGTCACA CGGCCGAGAC GGCCACACAA GCCGATGAAC TATCACGCCG CAGCGTGGAG
GTGACTCGTG CCGGTCAGAC TGCGATGCAA GACGTCGAAA CGACCATGAA AGATATCGGC
GAGTCCGCAT CACGCATCAA CGAGATCATC GGCATGATCG ATTCGATTGC CTTTCAGACC
AACATCTTGG CGCTGAATGC GTCCGTCGAG GCGGCACGAG CCGGTGAGCA TGGCAAGGGC
TTCGCCGTCG TCGCCGAGGA GGTCCGCACG CTGGCCGGGC GCTCGAGCGA TGCCTCACGT
GAAATTCGCG AACTGGTCAA CAACTCCGTC GCCAATGCCG AGTCAGGTTC GGAGAAAGTG
CGTAAAGCCG GCCAGACGAT GGAAGACATT GCCGGTAGCA TCGAGCGAGT TACGCAACTG
ATCGGTGAAA TCAGCACCGG CTCCCAGGAA CAGAGCAGCG GCATCAGCCA GGTCAACACG
GCCGTGACCG AACTGGATAC CATGACGCAA CAGAATGCGG CGATGGTTGA GCAAAGCAGT
GCCGCCGCCG ATGAAATGAG CCATCAAGCG GAACGTCTCA TGGCCCTGAT CGACTCCTTC
AAGTTGAGCC ATACCGAAGA AGGAGAATCA CCAGTGCGTC AGGCCCTCGT ACCTAGACAG
GGTGAAGCAT CGTACGCCGG TAAAAGACAG GCCCCGCTGC TGGCTGGCTG A
 
Protein sequence
MSNILNKKQL NKISVKYAAA FIAVALSMLT IVVVDERLVN LVKKRMTAFS GEFNVAISNV 
LNADRDLYQG QMSVLELLDE EPESDESQQF TKAYQENAEQ AYDRMQSFAA LMSKYPDVVT
ELEEFEARFD TWVGVSDDIV ELHRGGEVEK AQTLFDEEST RRFNELREIY DLAGQAVNAK
VTELEASMLD RINTQQTWVI AISAVIFMAA IALALIGPRR MSKAIRDVSE RIRDIAEEDG
DLTQRIESRR SDEIGELSER FNGFIDRIDG TLQAVRSSTL SVNSASDEIA KSSQDLASRT
EQTASNLQET SSSMEEITAT VRHTAETATQ ADELSRRSVE VTRAGQTAMQ DVETTMKDIG
ESASRINEII GMIDSIAFQT NILALNASVE AARAGEHGKG FAVVAEEVRT LAGRSSDASR
EIRELVNNSV ANAESGSEKV RKAGQTMEDI AGSIERVTQL IGEISTGSQE QSSGISQVNT
AVTELDTMTQ QNAAMVEQSS AAADEMSHQA ERLMALIDSF KLSHTEEGES PVRQALVPRQ
GEASYAGKRQ APLLAG