Gene Rcas_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4209 
Symbol 
ID5541720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5444002 
End bp5445510 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content61% 
IMG OID640896316 
ProductTPR repeat-containing CheR-type MCP methyltransferase 
Protein accessionYP_001434254 
Protein GI156744125 
COG category[N] Cell motility
[T] Signal transduction mechanisms
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1352] Methylase of chemotaxis methyl-accepting proteins
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0700485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACGA ATGGTATGAA TATGGCGCAC GATCTCTTCG TTCCTCCGCC AGTGCGTCTG 
TCGCCGGAGG CGTTCGACCG GCTACGCACC CTGCTGGCAG ATTATAGCGG CGTCTACCTG
GATACGGCGC AGCAGCGCGT GCTGGAAGCG GGTCTGGCGC AGCGTGTGGC AGCGCTTGGC
GAGACCCTCG AGTCCTATGA GCGCCACATC AGCGCACCGG CCGGTCGCAA CGAACTCCAC
CGTCTGGCAG AGATGGTGGT CAACCACGAG ACCTTTTTCT TTCGCAATGC ACCGCAGATG
CGCGCACTGC GCGAGACATT GCTCTTTGAA TTGCACCGTC GCAAGCCGCC GGGCGAGCCG
ATCCGCATCT GGAGCGCCGG TTGCGCAACC GGCGAAGAGG CGTATTCCCT GGCGATCACG
GTGCTGGAAA CGTTCGGTCT GGCACTGATA CGTCCGGTTG AAATCTGGGC AACCGATCTG
AGCGAACTGG CGCTCGAAAA GGCGCGGACC GGATTCTACC GGGGGCGCTC GCTCAACAAT
GTGACGCCAA TGCTGCTCAA TCGCTACTTC GTGCGGCACG GCGACGGATT TCTCGTGTCG
GACGCTGTGC GGGCGCTGGT GCGTTTCGAG CAACTGAATC TCCTCGAAAC GTTTCCGCCG
ACGGCGTATG GCGTCGACGC AATTTTCTGC CAGAATGTGA CGATCTATTT TCGTCCGGAG
ACGCGGCGTT CATTGATCGA ACGATTTCAT CGCTGCCTGC CGGTCCACGG GCTGCTCTTC
CTGGGATTTT CAGAAACGTT GTGGAATGTG TTCGATGGTT TTCGTTCACG CGAAGTATCG
GGGGCGTATG TCTACCAGAA GGTCAATCCG CCCGACCGAC CGACTCAGCA TCGCAGCACT
TCACCTCGAC CATCGCTACC AACAGAGACC CGACGACGTT CACCGTCCGT CGTCAAAGTC
GCATCCACCC CTTCTCGAAA GGCGCGACCG CCTGTTGCTG CGACAACCGC GCCAACTATG
GAAGAGGATG TCGGTCGTGT GGAACAGGCG CAAGCCCTGA TCGACGCCGG CAGGATCGAC
GAAGCGATGG ACCTGCTGCG CAGCATTCAC CCCAACTCGT CGCTGGCGCC GCGCGCGCTG
GTGCTGGTGG CGCGCGTGCA TGCCGATCGC GGCGAACTCG ACCTGGCCAT TGCTGAAGCG
CGCCGCGCGC TCGAAATCGA TGCATTGCGC AGTGACGCCT ATCTTCTCAT CGGAACGATC
TATGCCCGCC AGGGTCAGGG AAACGAGGCG ATCCAGGCGC TCGAACGAGC GCGCTACCTG
GACCCCGACG CCGCGTTGGT TTCCTATCAC CTGGCACTGG CATACCGTCA GGCGGGCAGG
CAGGAACAGG CGATGCGCGA GTTTCGCAGT GCGCTGAGCA AACTGGCCAG GCACCGGTCC
GAGGATCTCA TCGAAGGCGT CGAAGTCGGT TGGTTGCGCA CCACATGTGA GCAACACCTG
GGCATGTAA
 
Protein sequence
MRTNGMNMAH DLFVPPPVRL SPEAFDRLRT LLADYSGVYL DTAQQRVLEA GLAQRVAALG 
ETLESYERHI SAPAGRNELH RLAEMVVNHE TFFFRNAPQM RALRETLLFE LHRRKPPGEP
IRIWSAGCAT GEEAYSLAIT VLETFGLALI RPVEIWATDL SELALEKART GFYRGRSLNN
VTPMLLNRYF VRHGDGFLVS DAVRALVRFE QLNLLETFPP TAYGVDAIFC QNVTIYFRPE
TRRSLIERFH RCLPVHGLLF LGFSETLWNV FDGFRSREVS GAYVYQKVNP PDRPTQHRST
SPRPSLPTET RRRSPSVVKV ASTPSRKARP PVAATTAPTM EEDVGRVEQA QALIDAGRID
EAMDLLRSIH PNSSLAPRAL VLVARVHADR GELDLAIAEA RRALEIDALR SDAYLLIGTI
YARQGQGNEA IQALERARYL DPDAALVSYH LALAYRQAGR QEQAMREFRS ALSKLARHRS
EDLIEGVEVG WLRTTCEQHL GM