Gene Rcas_3076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3076 
Symbol 
ID5540572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3983275 
End bp3985125 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content58% 
IMG OID640895195 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001433148 
Protein GI156743019 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.315862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTCA CGCTGACCCC CTACAACAGC CTCCTGATAG TGCTTAGTTT CATTGCAATC 
GCAATTGCCG GCTATGCGTT GAAAAAACGG GCGATGCCGG GCGCATTGCC TTTGAGTCTC
CTCGCCTGTG CCATGTTCGT TTGGTTGGTG AGTTATGCGA TGGAACTGAG CAGTCAGACG
CTCTCCGTCG CGCTCTGGTG GGTCCGGCTT GAGTTTGCCG GCATTGTAGC GGTTCCAGTA
GCGTGGCTGT GGTTCGCGGT TGAGTATACC GGTGCGTGGT CCGGGATCAA TCGTCGCCGG
GTGTGGCTGC TGGGCATTGT GCCGCTTATC ACCATGATCG TCATTCTGAC GAATGAGTAT
CACCGGCAGT TCTGGCAGTC GGTGGTTCTG ATTACGGACG GTCCTTTCAC GGTCTTCGAT
TCGAGGCATG GCTTCTGGTT TTGGGTGCAT ACGGCCTATT CGTATCTGTG TCTGCTTTGT
GGAACCGGAC TTATTGTCCG TTTCATCCGT CGGACGCCGG GTTTGTTCCG TGGTCAGATC
GGCGCCATGC TGATAGCGGT TGCTGCTCCC TGGATTGGCA ATATCATCTA TCTTGCAGGG
CTGAGTCCCT GGGGAAAACT CGATCTGACC CCATTTGCGT TAACGGTGTC GCTCATTGCT
ATTGCCTGGA GCGTGTTCAG TTTTCGTCTC CTCGAAATCC GCCCGATTGC GCGTGATATG
GTGTTGCAGA GCATGAGCGA CGGTGTGATC GCAGTCGATG AGCAGGGGCG GATCATTGAG
GTCAATCGCG CCGCACAAAC AATGATAGGA TTACCCGCAT CGCAGATCAT CGGCAAACAT
GCGCGCGAGG TGATCGTGCA GTGGCCAGAG GTTGTTGCGC GCTACCGCGA TATGGTCGAG
GTGGCGGAAG AGATCGAGGT CAAGGTCGGT GCGGAGCGTC GCTGGTTCAA TGTGCAGATT
TCGCCGATCT ATGATGCGCG GCGGGCGTAT CGTGGGCGCC TGTTCGTCTG GCGCGACGTC
ACCGGGGAGC GCTTGATCCG TGAAGAACTG CGACGCAACA ATGAGCGCCT GCTGGAGGCG
CAACAGGCAT TGACCTCAGC GCTGCGAGCG GCGGAGGAAG GCAACCGTGT CAAGAGCGTC
TTTCTGGCGC ATATGAGCCA CGAGATTCGC ACGCCGCTCA CTGCGATCAT GGGGTATTGC
CAGTTGCTGG AAGCGGGGAT CGAGCGCCAG AACCTGGCGC AAACGCGTGC TGATCTGGAG
GCGATTCGGG TGGCGTCCGG TCATCTGTTG AGCCTTGCCA ATAATGTGCT GGAGATGGCA
TGGATCGAAG CCGGTCGATC CGAGGTGCAT GATATTGCGT TTGAGGTGAT GGATGTGGTG
CTGGATGTGA TCGCCACGGT GCAGCCGTTG ATCAGACGCA ATCGGAATCA CTTGCGGGTC
GAAGGGGCGG AACATGCGGG AGTCGTCCAG GGTGACCCGG CGAAGGTGCG CCAGATTCTG
CTCAACCTGG TGAGTAACGC TGCCAAGTTC ACGACTGGCG GCGAAGTAGC GGTGCGAGTT
GCGCACATTG GCGATGCATT GCAGCCGCGC ATCCAGTTTC AGGTGTCCGA CACAGGATCG
GGGATTGCGC CAGATCGGAT TGAGCGGTTG TTTATGCCGT TTGCGATTGC GGAAGAGCAC
GTGGGACGCG ATCAACGCGG CGTTGGGCTA GGGTTGGCGA TCAGCCGGTA TTATTGCCGG
TTGATGGGCG GCGATCTCTC CATCGAGAGC GCGCTGGGGC GCGGCACGAC GGTGACGTTT
TGGATTCCCG CGCGTGTGCC GCAGGGTGCG CTGGCGCGAG TGAAGTGTTA G
 
Protein sequence
MNVTLTPYNS LLIVLSFIAI AIAGYALKKR AMPGALPLSL LACAMFVWLV SYAMELSSQT 
LSVALWWVRL EFAGIVAVPV AWLWFAVEYT GAWSGINRRR VWLLGIVPLI TMIVILTNEY
HRQFWQSVVL ITDGPFTVFD SRHGFWFWVH TAYSYLCLLC GTGLIVRFIR RTPGLFRGQI
GAMLIAVAAP WIGNIIYLAG LSPWGKLDLT PFALTVSLIA IAWSVFSFRL LEIRPIARDM
VLQSMSDGVI AVDEQGRIIE VNRAAQTMIG LPASQIIGKH AREVIVQWPE VVARYRDMVE
VAEEIEVKVG AERRWFNVQI SPIYDARRAY RGRLFVWRDV TGERLIREEL RRNNERLLEA
QQALTSALRA AEEGNRVKSV FLAHMSHEIR TPLTAIMGYC QLLEAGIERQ NLAQTRADLE
AIRVASGHLL SLANNVLEMA WIEAGRSEVH DIAFEVMDVV LDVIATVQPL IRRNRNHLRV
EGAEHAGVVQ GDPAKVRQIL LNLVSNAAKF TTGGEVAVRV AHIGDALQPR IQFQVSDTGS
GIAPDRIERL FMPFAIAEEH VGRDQRGVGL GLAISRYYCR LMGGDLSIES ALGRGTTVTF
WIPARVPQGA LARVKC