Gene Csal_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2019 
Symbol 
ID4027103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2280856 
End bp2282559 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content63% 
IMG OID637967214 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_574069 
Protein GI92114141 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAA TGCTGAACGA CCTTTCGGTC AGGGCCAGCT TGACGATCGC ATTGACCGTC 
ATGGTCGTCA TGTGCGCGAT CATCAGCGCC ATGGGATTCT ATTCGAACCA GAAAAGCGCC
GAGGCGATGG AGACCATCGG CACCATCGGG TTCGAGCAGA CCAACACCAT CAACCGCGCC
ACCGTGAACC TGATACGTGC GCGCGGTCTG CTGGCCAGCT ACCGAAATGC GGTGGAAGCC
GGGGATACCG AGCGAGCGCA AGACTTGCAG ACAGCGGTTG CCAACGCCGT GGGCCAGGCA
AGCGATCGTA TCGATCAGTT CGCTCAGGTG ACCAAGACGG ATGCCGGACA AGAATACGCG
GCACGCATCG ACGAGGCTTT TGTCGCCTTG CGCGACGAAA TCGAACGTGA AATGAACGCC
GGCGAGGATG CCGTGCGTAC CGAGGACGAT CAGCGCATCA ACCCGCTGAT GGACGACCTC
GACGATAGCG TGCGCGATTT CATCAAGTAT GCCGAGGGGC GTGTCGGCGA CGCCATCGTG
GCGGATGCCG GCAACAGCCA GCTGATGGAA ATCCTGTCCA TCGTGCTGCT GGTTCTCGCC
ATCATCGTGG CGATTCTGGT GCGTCTGGTG CTCGTCAAGT CCGTGGTCAA GCCGCTGGAT
GAAGCGGTCG AGCATTGCGA ACACATCGCC AAGGGCGACC TCTCGCACCA TGTCGACGAA
CGTGGCAAGA ACGAAATCGG CCGCCTGTTC AACGCCATGC GCGACATGCA GCAAGGCCTG
GTGGGGACGG TCACCTCCGT GCGCGAGGCC AGCGGCTCGA TTCATGGCGG GGCCCGCGAG
ATTGCCTCCG GGAACGCCGA TTTGTCCTCG CGCACCGAGC AGCAGGCCGC CTCGCTCGAG
GAAACCGCGT CGAGTATGGA AGAGCTGACC TCCACCGTGC GCCAGAACGC CGACAATGCG
CGTCAGGCGA GTTCCCTGGC CAATGATGCC TCGACCACGG CCGGACGCGG CGGCGATGTC
ATGCAGGAGG TCTCCACGAC CATGCAGGGC ATCACCGAAA GCTCCAAGCA GATCTCCGAT
ATCATCGGCA TGATCGATTC GATCGCCTTC CAGACCAACA TTCTGGCGCT CAACGCCTCG
GTCGAGGCGG CGCGTGCCGG TGAACAGGGC CGCGGGTTCG CCGTGGTCGC CAGCGAGGTG
CGCAACCTGG CCAGCCGCAG TGCCGAAGCC GCCAAGGAAA TCAAGGGGCT GATCCACACC
TCGACCACAC AGATCGAGCA GGGCTCCGAG CTGGTCGGCA ATGCCGAAAC CACCATGCGC
GACGTGGTGC AGGCCGTGAA GCGCGTCAGC GACATCATGG ATGAAATTTC CGCGGCGTCG
CAGGAGCAGA GCGACGGCAT CGAGCAGGTG AGCCAGGCCG TGACCCAGAT GGACCAGGTG
ACCCAGCAGA ATGCCTCCCT GGTTCAGGAA GCCTCCAGCG CCTCCGCGTC GCTCGAGGAA
CAGGCGCAGC GCCTGGAAGA CGTGGTGTCC ACGTTCCGTC TGCCGGGCGG CAGCACGCGT
CAGTTGTCGC GTGCCAATAC GTCGCCCGGC AAGTCCGCCG GATCGTCCGC GACGCCATCG
ACCTCTGCGC AGGGTGCCTC TCAGCGTGTG CCGGCCAAGC GCGCGCCGGT CACCCAAGAA
GAGGACGAGT GGGAAGAATT CTAA
 
Protein sequence
MGKMLNDLSV RASLTIALTV MVVMCAIISA MGFYSNQKSA EAMETIGTIG FEQTNTINRA 
TVNLIRARGL LASYRNAVEA GDTERAQDLQ TAVANAVGQA SDRIDQFAQV TKTDAGQEYA
ARIDEAFVAL RDEIEREMNA GEDAVRTEDD QRINPLMDDL DDSVRDFIKY AEGRVGDAIV
ADAGNSQLME ILSIVLLVLA IIVAILVRLV LVKSVVKPLD EAVEHCEHIA KGDLSHHVDE
RGKNEIGRLF NAMRDMQQGL VGTVTSVREA SGSIHGGARE IASGNADLSS RTEQQAASLE
ETASSMEELT STVRQNADNA RQASSLANDA STTAGRGGDV MQEVSTTMQG ITESSKQISD
IIGMIDSIAF QTNILALNAS VEAARAGEQG RGFAVVASEV RNLASRSAEA AKEIKGLIHT
STTQIEQGSE LVGNAETTMR DVVQAVKRVS DIMDEISAAS QEQSDGIEQV SQAVTQMDQV
TQQNASLVQE ASSASASLEE QAQRLEDVVS TFRLPGGSTR QLSRANTSPG KSAGSSATPS
TSAQGASQRV PAKRAPVTQE EDEWEEF