Gene Rcas_2639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2639 
Symbol 
ID5540121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3400646 
End bp3402571 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content57% 
IMG OID640894763 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001432730 
Protein GI156742601 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATG TGACCACAAC CGGATGGCAA CCGGCGTTGC CCTACACACT GCCACTCGTG 
ATCAGCGCAT CACTGGCAGT TATCGTCGCC ATGTTTGTCT GGCAGCGCCG CACTGTGCCC
GGCGCGCGAC CACTGATTGT CTTGAGCATC GCGGCTGCGG TGTGGTCGTT TGCCTATGCG
ATGGAAATCG CCGCAACCCC CGCGCCGATT GCACTCTTTT GGGCGCGTGT GCAATATCTG
GGGATTATGA CATTGCCGGT TGCCTGGATT GCCTTCACGC TGGAATATGC CGGTCTGAAA
CCCTGGCTAA CCAGGCGAAC CATGGCGGAG ATCCTCATCG TTCCCGCACT GACACAGATT
GCCGTCTGGA CGAACGATGA CCATGGTCTG ATCTGGCCCC AGATTCGGTT GAACACCGAA
GGACCCTTTC CAATCCTCGA TTTTGATCAC GGACCGGCGT TTTGGGTCTG CAATATCTAT
GCCCATATCT GCCTGGCGAG CGGGACATTG ATCCTGCTCT GGCGATTTGT GCGGTCGCAG
CAGCTCTATC TGAGCCAGAT TGTCGTCTTC CTGATCGGGG CGCTTGCGCC CTGGATTGGA
AACGGATTAT ATGTGCTGAA CCTTACGCCG TGGACCGGGC TGGATCTGTC GCCGTTTGGG
TTCACCTTCA CCGCCGGCGC AATTGCTTTT GGCGCCGTGC GATTGCGGTT CCTTGATGTT
GTGCCCATTG CGCGCGATAT TGTGCTCGAA AGCATGAGCC AGAGCGTCAT CGTGCTCGAT
GATGACAACC GGATTGTCGA TATCAATCGC GCGGCTCAAC ACGTCATCGG CTGCACCGCC
GCCGAGGTGA TCGGCAAACC GATCCGTCAG GCTCTGAGTC GCTGGCCACA GGTTCTTGAT
CGCTACTACA ATATTGTTGA ACTGAACGAA GAGGTGCAAC TCGAAATCGA AGGGCGCCCG
CTTGTGCTGG ACGTTCTCAT CTCGCCGCTG CGTGATCGGA ATGGTCGACT CAGGGGACGA
TTGATTGTCT GGCATGATAT TACTCGCCTC AAACAGATCG AAGAGGTACT GCGTCAGCGG
AACGACGAAC TGACCGCGCT CCAGCAAACA CTGATGGTCG CCAGAGATCA GGCTGAAGCC
GCTCATCGCG CCAAGAGCGC TTTTCTGGCG CATATGAGCC ATGAAGTGCG CACGCCGCTC
AGCGCCATCC TGGGTTATAC CGATCTGATA CGCCTTGATC TGACACGCCG GGGGCAGTCC
GTATATCAGG AGGAGCTGGA GGCCATCCAC GCCTCGGCGC AGCATCTCCT GACAATGATC
AACAACATTC TGGATCTCTC GAAGATCGAC GCGGGAAGAA TGCCCTTGTA TATTGAACTC
TTTTCTATCG AAGCACTGGT TCACAATGTG ACACAGACCG CGCGCCCTCT CGCCGCGCGA
AACGGGAATA GTCTGACGGT CATCCGCGCG CCCGACGCCG ATCTTATGAC GAGCGACAAG
ACGAAGATTC GACAGGTGCT GTTGAATCTT TTGAGCAATG CCGCAAAATT CACCGAAAAC
GGCGCTATCA GCCTCCGTAT CTGGCGTGAA TTGTCTCTTT CGCCTGCGAT GAAAGCAATC
GACGACGGCG CCGACTGGAT CGTGTTTGAG ATTGCCGACA CTGGCATCGG CATTGCACCT
GAACATCTGC CGTTGCTGTT TCAGGATTTT TCGCGGATCG AAGATGCCGA CCACCAGCGG
TATGGCGGCA CCGGACTGGG TCTGGCGATC AGTCGGCAAT TCTGCCGGTT GATGGGAGGC
GATATCACAG TAGCCAGCGC ACCCGGGAAA GGCACGACGT TTACCGTCCG GCTACCGGCA
ACGATCCGGT CGGAGGGGTC CGACAACGAT GCCGTCGTGT CCGACCCTGC ACAGGTGGAG
CGATGA
 
Protein sequence
MADVTTTGWQ PALPYTLPLV ISASLAVIVA MFVWQRRTVP GARPLIVLSI AAAVWSFAYA 
MEIAATPAPI ALFWARVQYL GIMTLPVAWI AFTLEYAGLK PWLTRRTMAE ILIVPALTQI
AVWTNDDHGL IWPQIRLNTE GPFPILDFDH GPAFWVCNIY AHICLASGTL ILLWRFVRSQ
QLYLSQIVVF LIGALAPWIG NGLYVLNLTP WTGLDLSPFG FTFTAGAIAF GAVRLRFLDV
VPIARDIVLE SMSQSVIVLD DDNRIVDINR AAQHVIGCTA AEVIGKPIRQ ALSRWPQVLD
RYYNIVELNE EVQLEIEGRP LVLDVLISPL RDRNGRLRGR LIVWHDITRL KQIEEVLRQR
NDELTALQQT LMVARDQAEA AHRAKSAFLA HMSHEVRTPL SAILGYTDLI RLDLTRRGQS
VYQEELEAIH ASAQHLLTMI NNILDLSKID AGRMPLYIEL FSIEALVHNV TQTARPLAAR
NGNSLTVIRA PDADLMTSDK TKIRQVLLNL LSNAAKFTEN GAISLRIWRE LSLSPAMKAI
DDGADWIVFE IADTGIGIAP EHLPLLFQDF SRIEDADHQR YGGTGLGLAI SRQFCRLMGG
DITVASAPGK GTTFTVRLPA TIRSEGSDND AVVSDPAQVE R