Gene Csal_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1969 
Symbol 
ID4027209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2224457 
End bp2226076 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content61% 
IMG OID637967165 
Productflagellar hook-associated protein 
Protein accessionYP_574020 
Protein GI92114092 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTT TCTCGATCGG CCTGAGTGGC CTGAGCGCGG CCCAATCAGC GTTGTCGACG 
ACCAGTAACA ACCTGAGCAA CGTGTATACC GAAGGGTATA ACCGTCAGTT GACGATCCTG
GGAGAAAGCT ACAGCAACGG CTCGACGGGC ACGGGGGTGT CGGTCGACGA AGTGCAGCGC
CAGTTCAACA GTTACGTCAC CGAGCAGTAC AACGACGCCA ACAGCCAGAA AAGCGCGCTG
GAGAGTTATC AGAGCCAGGT CTCGCAGATC GACGATCTGC TCGCCGACAG CGACGCCGGG
CTCGCACCGC TGGTGCAGGA CTTCTTCTCC AGCCTCGACG ATCTCACCGG TGCGGCATCC
GACCCGGCGG CCCGCCAGGG CGTGCTGGGC TCGGCATCGT CCATGGCGGC ACAGTTCCGT
TCCTTCGACT CCTATCTCGG CGACATGCGC AGCAGCCTCA ATGGCCAGCT CGAAAGCACG
GTGACGGATA TCAACAACAC CGCCACCCAG ATCGCCAAGC TCAACGATCA GATCACCCAG
GCGCGGGCCA AGTCCGGCGA GGAACCCAAC TCGCTGCTCG ACAAGCGCGA CAAGCTGGTG
TCCGACCTCA ATGAGCTGGT GGGTGCGGAC CTGAATATCC AGGATGGCGA CACCTACAAC
ATCAATCTGG AAAACGGCCA GCCGCTGGTC TCGGGCACCC AGAGCTACGA TCTCGAGGCG
GTGGAGTCCA ACGAGGATCC GTCGCGCACC GTGATCGCCT ATCGGGATGC GGCGGGCAAC
GTCAACCAGA TCGATGACAG TGCCTTCGAG AACGGCGAAC TGGGCGGTCT GCTGTCGTTT
CGCAGCGAAA CGCTGGACAG CGTGCAGAAC CAGCTCGGCC GCATGACGGT AGTACTCGGC
TCGGAGTTCA ACGCCGTGCA CGAGAACGGT GTCGACCTCA ATGGCGACGC CGGCGAGGCC
TTTTTCGAGA TCGGTTCGCC GCGGGTCATG AGCAACAGTC AGAACGACGG CGACGGTGCC
ATCAGCGCCG AGTACAGCGA CGTGAATCAG CTGACCTCGA GCGATTATCG AATCACCTAC
ACGGGTGGCG AGTACCAAGC CGAGCGTCTC TCGGATGGCG ACACGACGAC CCTGACGCCG
GATGGTAGTG GTGATGTCGA TCTTGACGGC ATGACCTTGA ATATTTCCGG TACCGCCCAG
GAAGGCGACA CCTTCATGCT GCAGCCCACG CGTACGGCCT CGCAGGGTTT CGAGGTGGCG
ATCACCGAGG GGGCGGAAAT CGCCGCGGCG AGCAGCGGGG GCGGTAGCGG CAACAACGAG
AATGCGCTGG ATCTGCTCGA CCTGCAGAGC GAGAAGATGG TCGGCGGAAC CTCTTCGATC
AGCGATGCCT ACGCATCGCT GGTCAATGAA GTCGGTAACA CCACCAACAT CACCCAGGTC
AATCTGGATG CGCAGTCGGG GCTGACCGAG CAGCTGCGCG AGTATCAGCA GTCCGAATCG
GGCGTCAACC TGGACGAGGA ATACGCCAAC CTGGTCCGTT ACCAGCAGTA TTACCAGGCC
AATGCGCGCG TCATCGACGT GGGCTCCCAG GTGCTCGATT CGGTCCTGCA GTTGCGTTGA
 
Protein sequence
MSIFSIGLSG LSAAQSALST TSNNLSNVYT EGYNRQLTIL GESYSNGSTG TGVSVDEVQR 
QFNSYVTEQY NDANSQKSAL ESYQSQVSQI DDLLADSDAG LAPLVQDFFS SLDDLTGAAS
DPAARQGVLG SASSMAAQFR SFDSYLGDMR SSLNGQLEST VTDINNTATQ IAKLNDQITQ
ARAKSGEEPN SLLDKRDKLV SDLNELVGAD LNIQDGDTYN INLENGQPLV SGTQSYDLEA
VESNEDPSRT VIAYRDAAGN VNQIDDSAFE NGELGGLLSF RSETLDSVQN QLGRMTVVLG
SEFNAVHENG VDLNGDAGEA FFEIGSPRVM SNSQNDGDGA ISAEYSDVNQ LTSSDYRITY
TGGEYQAERL SDGDTTTLTP DGSGDVDLDG MTLNISGTAQ EGDTFMLQPT RTASQGFEVA
ITEGAEIAAA SSGGGSGNNE NALDLLDLQS EKMVGGTSSI SDAYASLVNE VGNTTNITQV
NLDAQSGLTE QLREYQQSES GVNLDEEYAN LVRYQQYYQA NARVIDVGSQ VLDSVLQLR