Gene Rcas_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4301 
Symbol 
ID5541812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5548521 
End bp5550260 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content58% 
IMG OID640896407 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001434345 
Protein GI156744216 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCTAT CGATCCGCAC CAAATTGCTG GCAGCGCTCG GCGTTGACCT GATCCTCATG 
CTGGTGCTGG GCAGTTTTGC CCTGCACCAG ATGAGCATTA TGAACCAGAA AGCCGATTTT
GTCGTCAACC AGACGATCCT TTCGATTGAT CTGGTCAACG CAATGAACGA TGTGCTCCTG
AATTACCGTA CCCGACAGAT GGAGTACATC CTCAACGCCG CCCCTGCCGA CAAACAGCGC
ATCGAGAAGG AACTGCTGGA CCTCGAAACG CGCATGGACG GCATTTTCCG CAATTACAGC
GCCAACTATC AACCGGATGC GACGGAACGC CTGATCTTTG AACAGACGCA GCAGGACTGG
CAGCGCTACG TGTTTTTGAC ACACACCCAG TTTCTGCCGG CGAATCGCAA CAGCAACACC
GGGAATGTGC ATCCATCGTT TGGGCGGTTG TCACCGCTGT ATGGCAGCCT GCAAACGAAC
ATGCAGAAGA TCAGGGCGCA GAGTCAGGCG CGCGCCGAGG CGGCGCGCGC AAGCGTCGAA
ACGGCGTATT CCACCTCACG CTTTGTGATT GTGAGCGAAA CTATTCTGAC CGTATTCGTC
TCGGCGGTGA TTGGACTGAC CCTCTCCGGC AATATTGCCC GCCGCATTCG CACGCTGCGC
GATGCAACCA TTGCCGTTTC CGGCGGCGAT CTGAGCCGGC AGGTGTCGCT GCGCGGCGGC
GACGAACTGG TGTTGCTGGC GAACAATTTC AACCTGATGG TCGCCAGCCT GCGGCAGCAA
CGCATGCTGC TCGAAGAGCG CAATGCCGAA CTCTCAGCGA GCCTGGAGAC GCAACAACGG
TTGATGGAAG ACCTGGTGCA GCGCAAACAG GCGGAGGAAG CGGCGCATCG CGCGCAGGCG
GCGGCGGAAG CAGCCAGCCA CGCGAAGAGC ATGTTCCTGG CGACGATGAG CCACGAACTG
CGCACGCCGC TGAACGCGAT CCTGGGGTAT GTTCAGTTAT TGCACCTCGA AGCGCAAATC
CATGGACGAT CCGAGATGCT CCCCGATCTG GAGCGCATCC GTTCGGCGGG CAAGCATCTG
CTTACCATCA TCAGCAATAT TCTCGACTTC TCGAAGATTG AGCAGGGCCG GATGAATGTC
GAGATCGACA CCTTCAATGT GAGCGTGATT GCGCACGAAA TGATCAGCAT TATCGAACCG
CTGGCGCGCA ATCGCAACAA CACGCTGACC CTCACCTGCC CCCCAGACAT CGGTATGATG
CAGTCCGATG CGGGCAAAGT GCGTCAAATT CTCTTCAACC TGTTGAGCAA CGCGGTTAAG
TTTACCGATA ACGGCACGGT GGCGCTGACT ATCGAACGTG AATGTTGTTC TGACGGCGAT
TGGGTGCGCT TCAGCGTCGC TGACACTGGC ATTGGCATGT CGCCAGAACA ACTGACGCGT
CTGTTCCAAC CGTTCACACA GGTGCATCAG AGCCACTCGT CGCACGCACA TCGCGGCACG
GGCCTTGGGC TGGCGCTCAG TCAACAGTTA TGTCGCCTGC TCGGCGGCGA CATTTCGGTC
ACCAGCGAGG TCGGCAGAGG ATCGGTCTTC ACTGTGCGTT TGCCAGCAGT CATCAGCACT
GCCCATACCG CCGATGTACG TCTCGATTTT GCGCAACACA TACGAAGCGC GACTCGCCAC
GACGTGGATC ATGCCACTAC GGCGCCATCA CCGTACACAA CGACAACGCT CAGCGCGTAG
 
Protein sequence
MNLSIRTKLL AALGVDLILM LVLGSFALHQ MSIMNQKADF VVNQTILSID LVNAMNDVLL 
NYRTRQMEYI LNAAPADKQR IEKELLDLET RMDGIFRNYS ANYQPDATER LIFEQTQQDW
QRYVFLTHTQ FLPANRNSNT GNVHPSFGRL SPLYGSLQTN MQKIRAQSQA RAEAARASVE
TAYSTSRFVI VSETILTVFV SAVIGLTLSG NIARRIRTLR DATIAVSGGD LSRQVSLRGG
DELVLLANNF NLMVASLRQQ RMLLEERNAE LSASLETQQR LMEDLVQRKQ AEEAAHRAQA
AAEAASHAKS MFLATMSHEL RTPLNAILGY VQLLHLEAQI HGRSEMLPDL ERIRSAGKHL
LTIISNILDF SKIEQGRMNV EIDTFNVSVI AHEMISIIEP LARNRNNTLT LTCPPDIGMM
QSDAGKVRQI LFNLLSNAVK FTDNGTVALT IERECCSDGD WVRFSVADTG IGMSPEQLTR
LFQPFTQVHQ SHSSHAHRGT GLGLALSQQL CRLLGGDISV TSEVGRGSVF TVRLPAVIST
AHTADVRLDF AQHIRSATRH DVDHATTAPS PYTTTTLSA