Gene Rcas_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1113 
Symbol 
ID5538579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1441202 
End bp1444330 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content60% 
IMG OID640893247 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001431230 
Protein GI156741101 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCA CGATTGAAAC CATGAATACG ACACACGCCG CCTCATCGTC AGCACAGGTG 
ATCCGGCGCA TCATTGCCGG TTTTGCGCTT GCGCTCGGCT CGATCACCGT CATTGCGTTG
ATCTCCTATG GATGGACGCA ATACATCCTG GCAAACGAGC GTGATCGCTT CAATCTGCTG
GCAGCCAACA GTCTGCAGCT CATCCGCCTT CAGCAAATTG CGCTCGCAGC GGAACAGGTG
ATGGACACGT CCGATCCCGA AATAGCGGCG CAGGCGCGAG AAACGTTGAG CGCCGCCATC
GATGAGATGG CGCAGTCGCA GCAGGAACTA TCGCGCATGG TGGCAACGTT GCCGCCGGAC
GATCCACTCT GGCGCCTGTA CGCTGATCCG GCTACGGCGT TCGATGCGCG GGTGCAGGAG
TTTCTCGCAC GCGCCCGGCG CTTGCGCGAT ACTGCCCCTG ATCCGCAGCA CCCTGATCTC
CAGGTGATCC GTCGGATGGC GCTGTATGAC CTGCCGCCCC TCTTTCGCAA CGCCTCCCGT
CTATATGTCG ATGCGCGTCG CAGCCTCCTG CAAACGCTCG ACATCATGCA TGCGACCTTT
CTTGGCGTGA CACTCCTGGC ATTAGCGTTC GAGGGCATCT TCATCTTTCG CCCAATGGTG
CGAGAGGCGC GCGGCTATAT AACACAACGC GACGTAAGCG AATCTCGTTT GCGTGCGCGC
GAACGGGTGA CGCGCGCGCT CTATGACATC ACCTCAACGT CGCAACTGGA TCACTTGCAG
AAAGTGCAGG CGCTTCTTGA GATGGGATGC GACTACTTCC ACATGCCGGT CGGCGTATTG
ACTCGCATCG ATGGCGAGGA ACTCGAAATA GTCGCCGTGC GCGATCCTGG TCATCGGATG
AAGCCTGGCG AACGTTTCCC GCTTGCTGAT CGCTATTGCG CCGCCGTTCT CGACGCAGGC
GCACCGGTGA TCATTGATCA TGCATCGCAA TCCGGATGGA ATGACCATCG CTGCTATCGG
TTGACCCGCA TGGAAGCATA CATCGGCGCG CCGGTCCAGG TGCGCGGCAC GACAGCGGGA
ACGCTCTGTT TTGCGAGCGC CATGCCCCGC ACAACGCCGT TCACCGATGG CGATTATGAC
CTGATCCGAC TGATGGCGCA ATGGATCGGC AGTGAGCAGG ATCGCCTGCA AACTGAAGCA
GCGTTGCGCG AGAGTGAAGA ACGCTTCGCG CTGCTCGCCA GCGTTACGAC CGAAGGAGTG
ATTATCAGTG AGCAGGGGAT GATCGTCGAC GCCAATGCCG CTGCTGGAAC TCTGTTTGGC
GTGCCGCCGG AGCGTTTGCG CGGCATGCCG GTCTTTGAGT TTACGACGCC AGAAGGGCGC
GAGAAAGTCG CGCGCGCGCT CACAACCGGC TATGATCGTC CTTATGAGGT GCTCGCCCGG
CGTATCGATG GGACCCTCTT TCCCGCTGAG GTCACCGGTC GCAACATTCC CTATCACGGT
CGCACGGCGC GCGTGACGAC GATCCGCGAT ATTTCGCGGC AGCGACTGGC GGAAGCCGCA
CTCCGCGCCA GTGAAGAGCG GTTTCGCCAG TTGGCGGAGA ACGTCAATCA GGTCTTCTGG
ATGTCTACTC CATCGCTCGA CCAGATCCTG TACGTCAATC CGGCCTACGA ACGCATTTGG
GGGCGATCCT GCGATAGCCT GTATGCTCAA CCGTCCTCGC TCTTCGAGGC GATTGTTCCT
GAAGACCGCG AGCGAGCGCT TGCGCTTCAT CGGGCAGAGT ATGCGCGCGG ATACAGTATC
GAGTTTCAAA TCCTGCACAC CGACGGTCAG CAGCGCTGGA TCCTGACCCG CGCATTCCCG
GTGATCAACG AGGCGGGCGC CGTGTATCGC ATCGCCGCGA TTTCTGAAGA TGTCACCGAG
CGCAAACAGG CAGAGGCGGA ATTGCGCGCC GCAATGACGG CGCTGGAAGT GCAGTATCAG
ACCGCAGATC GCGCCCAGAG CGAACTTCGC GCCATCCTCG ACGCTTCCAG CGAGGTGATC
GCCCTTCTCG CGCCAGATGG CGCGTTCCTG ACCGTCAACC GTCGTTTCTG TGATATATTT
GGCGTGCCTG CCGATCAGGT GCTGGGACGA CGCCTGATTG ATATGCGCGC CGAGATCCGG
TGGTTCTTCG GTGATGCCGA TGAGGTGTAT GAGCGGATGC TCAGCGCGTT ACAGGATGCT
CAAGACGTCT TCCGTGAGCA GGTAGTGCAA CTCAGACCGC AGCATCGTGA ACTCGCCATC
TTCTCCGCGC CGGTCTGGAC TGCCAATCAG GTCCACCTTG GGCGACTCTA CGTCTTCCGC
GATGTCACCC ATGAACGCGC CGTTGAACGC ATGAAGTCCG AGTTTGTGGC GATGGTGTCG
CATGAACTGC GCACGCCGCT GACTTCTATC AAAGGGTATA TCGATATGCT GCTCGATGGC
GATGCCGGAC CGCTTGCCGT CGAGCATCAG GAACTTTTAC AGATCGTCAA ATCGAATGCC
GACCGGTTGC TGCTCCTGAT CAACGATCTG CTCGATATGT CGCGCATTGA GGCGGGGAAA
CTGATGCTCC ACCGCGCGCC GCTCGATGTT CGCCCGTTGA TCCGTCAGGT TGCCACAGCC
CTGCGCCCGC AACTGGACGC AAAACACCAG CGTCTGAATC TGGACCTTTC TGAAACCCCT
CCCGATGACG CGCCGCCGCT GATGTTCGGT GATGCGGCGC GTGTCCATCA GATTTTAACC
AACGTGCTCT CGAATGCGAT TAAGTATACA CCTCAGGGTG GCGAGATTTC AGTTCGTCTC
TCCGTGGAAC CGCCGTGGAT GTGCATCGCC GTGCAGGATA CCGGCATCGG CTTGACCCCT
GAGGAACAGG AGCGCATCTT CGATCGTTTC TACCGCGTGC GCAACCGTGC AACGCGCGAA
GCCAGCGGAA CCGGGCTTGG TCTGGCTATT ACCCGGTCGC TGGTTGATCT GCACCAGGGG
CGCATTACCG TTGAGAGCGA AGCAGGCAGA GGTTCGACGT TCCGGATCTG CTTTCCGCTG
CTGACCTCGC TCGATGGCAT CAATGACGAT GAGGCATTGG CGCAAGTTTC GACGGGGGAA
CTGACGTGA
 
Protein sequence
MPRTIETMNT THAASSSAQV IRRIIAGFAL ALGSITVIAL ISYGWTQYIL ANERDRFNLL 
AANSLQLIRL QQIALAAEQV MDTSDPEIAA QARETLSAAI DEMAQSQQEL SRMVATLPPD
DPLWRLYADP ATAFDARVQE FLARARRLRD TAPDPQHPDL QVIRRMALYD LPPLFRNASR
LYVDARRSLL QTLDIMHATF LGVTLLALAF EGIFIFRPMV REARGYITQR DVSESRLRAR
ERVTRALYDI TSTSQLDHLQ KVQALLEMGC DYFHMPVGVL TRIDGEELEI VAVRDPGHRM
KPGERFPLAD RYCAAVLDAG APVIIDHASQ SGWNDHRCYR LTRMEAYIGA PVQVRGTTAG
TLCFASAMPR TTPFTDGDYD LIRLMAQWIG SEQDRLQTEA ALRESEERFA LLASVTTEGV
IISEQGMIVD ANAAAGTLFG VPPERLRGMP VFEFTTPEGR EKVARALTTG YDRPYEVLAR
RIDGTLFPAE VTGRNIPYHG RTARVTTIRD ISRQRLAEAA LRASEERFRQ LAENVNQVFW
MSTPSLDQIL YVNPAYERIW GRSCDSLYAQ PSSLFEAIVP EDRERALALH RAEYARGYSI
EFQILHTDGQ QRWILTRAFP VINEAGAVYR IAAISEDVTE RKQAEAELRA AMTALEVQYQ
TADRAQSELR AILDASSEVI ALLAPDGAFL TVNRRFCDIF GVPADQVLGR RLIDMRAEIR
WFFGDADEVY ERMLSALQDA QDVFREQVVQ LRPQHRELAI FSAPVWTANQ VHLGRLYVFR
DVTHERAVER MKSEFVAMVS HELRTPLTSI KGYIDMLLDG DAGPLAVEHQ ELLQIVKSNA
DRLLLLINDL LDMSRIEAGK LMLHRAPLDV RPLIRQVATA LRPQLDAKHQ RLNLDLSETP
PDDAPPLMFG DAARVHQILT NVLSNAIKYT PQGGEISVRL SVEPPWMCIA VQDTGIGLTP
EEQERIFDRF YRVRNRATRE ASGTGLGLAI TRSLVDLHQG RITVESEAGR GSTFRICFPL
LTSLDGINDD EALAQVSTGE LT