Gene Rcas_1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1602 
Symbol 
ID5539078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2063058 
End bp2066066 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content71% 
IMG OID640893739 
Productsignal transduction protein 
Protein accessionYP_001431712 
Protein GI156741583 
COG category[T] Signal transduction mechanisms 
COG ID[COG5635] Predicted NTPase (NACHT family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCAC CAAACGACTT GTTCACCAAC CTGGCCGCCT CGCTGGCCTA TGACCTGTTG 
AAAGCGGGCG CTGCGCGGCT CACGACCTTT GTTTCGGGTT CGCCGGAGGA GCAGACCCTG
CGCCGTTGCT ACCAGAGCGC GTTTGCGGCC ATGCTGAGCG AGGTGACCGC CAGCCTGGAC
GACGCCCATC AGGAGCTGGT CGAGTACATT CTGCGCCAAT TTGTGATCCA ACCGCAGGTG
GCGGAGGGAT TGCTGGCCCT GGCGCTGAAG GGCGCCGACC TGCCCGACCT GCCCCGCCTG
CGTCAGCACT TTGACGCGTT GGGCTTCGAC CGCGCCACCC TGCCCGTGGA TTTTGACGTC
GCCCTGGCCG CCTTTACCCG CGGCCTGACG GAGGCACTGC TGGATGAGGC CGTTCGGCCG
GGGAGTCCGC TGTACAATCG CGTGAGCCTG GGCCGTGTCC TCGCCCTGCA CGCCCTGTTG
CAGCAGCAGG GCAAGACGCT GGAGGAGATT CGCGATGCGA TCCGGCGGCT GGATGCCCAC
GGCGGCGCCA CGTACAACGT CATCATTGCC CGGGCCACCG GTGTCGCCAT TGGCGATGGG
GCGCGCGTGG AAGCCGCTCT GCCCGCCGAC GTGCGCGACC TTTTGCAGCA GATCCTGCAG
GCGCTCAAGC CGCCGCCCGA CTACACCGAC GCCGACCTGC TGAACGCCTA TCTGGATTGG
CTGATCCGCC AGCACAGCAC CCTGGAACTG CGCGGCATCC ACCGGGCCGG GCCATCGCTG
GCCGTGCCTC TGGAACGGGT CTATGCGGCG CTCCAGGCCG AGACTGTCCC CCCGACCGAG
TGGCAGGAAA GTTACCGCCT GCTGGAAGAA GACCTGCAGG CCTGGCTGGA GCAGCAGGGC
CTGGACGATT TGGCCGAAGC GGAACGCCGC CGCTACCGCT GGCGCTTCCT GGCCGGGCAC
CCCCTCATGC CCGCCCTGGA GGAGCGGGAC CGCCCGCGCC TCTTCTCCGA CCGGAAGGCC
GAAACCCTCG ACCTGGCCCA AGCCGTGCGC CGCTTCCGCT GGCTGGTCAT CCTGGGCGAC
CCGGGCAGCG GCAAGACCAC CCTGCTGCGC TGGTTGACCC TGCACCTGGC CCGCGCCCTG
CGGGAGGGCG CCGACCGCGT CCGCGTCCCC GCGATGCACG TGGACGCCGA AGCAGACGAA
GACGCGCCGC CCGTAGACCT GGGCCCGGCC CGCCTGCCCG TGTTGGTGCG CATCGGCGAC
TATGCCGAAG CCCGCCGCGC GACCCAAGCC CGCAGCGAGC CGCCCCCCTC CTTGCTGGAA
TTCCTGGGGC ATCATGGCTG GCAGAGGGAT TTCCCCACCT TTGGCCGGGA ACACCTCCGC
CGGGGAGAGC GCCTGCCCGC GGACGGGTTG CGCCGCCTCA TCCGCGACTT TTTCCGCCGC
GGCCAGGCCA TGCTGCTGTT GGACAGCCTG GACGAGATCA CTGCGGCCGA CGACCGCCGC
GAGATCATCG GGGCGGTTGA ATCTTTTCTG CAAGAATGGA TCACCGACCC GGGCGGCCGC
TCCCCCCTGG ACCCGGGCGC ACTGCCCTGG CGCGACGCAG GAGCCCTCTC TCCGGACGAA
AGCGGCGGCA ACCAGATTAT CATCACCAGC CGCATCGCCG GCTACCACGC TGCGCCCCTC
AGCCTCCACC TGACCCACGC CACCCTGCAG CCGATGAGCG ACGCGGCCGT GGACCGCTTC
TGCCAGACCT GGACGCTGGC CGTCCACCGC CTGCTGGCCG ACCCGGGCGA CAGCGAGGAA
GAGGTGGCCC GCCGCGCCGC AGACGAGGCG CAGGCCCTGC AGGCTGCCCT GCACGACCCT
ACCCGCCCCG GCCTGCGCGA ACTGGCCGGC AACCCCCTGC TGCTGACCAT CCTGGCCATG
GTGCATCACA ACAGCCAGGC CCGCCTGCCG GAACAGCGCG TCCGCCTCTA CCAGATCGCC
GTGGAGAACC TGGTGGAGGT CTGGCGGGAT ACCGGCCTCA GCGAGGATGA GGTCGTCCAG
GTGCTGGCCC CCCTGGCCGC GCACATCCAC GAGCGCTATC CCAGCGGCCT GATTGAGGAA
AACGAGTTGC GCGAGCAGGT GACCCACAGC CTGGCCGAGT ATCGCGGCGA GAACCCCGAC
CGCCCCTCGC CGGCCTTTCG GCGGGATGTG GAGGCCTTCC TGCGGGCGGT GCGGGAGCGG
GTGGGGCTGC TGTCCGCGCG GGGCGAAGCC GCCTACGGCT TCCTTCACCT GACCTTCCAG
GAGTACCTGG CGGCCCGCCA CCTGGCCGGG GACCCCAACG CCGCCCTGGA GGCCATCCTG
GCCCGCCGCG ACGACCCGCG CTGGCGCGAG CCGGTCCTCC TGGCCCTGGG CTATCTTTCC
TGGAGCCAGA ACATGGCCGC CCGCGGCCGC CTGCTGCGGG CCTTCCTGGA GGCCGACGAC
CCCCTGGGTG ACCTGCTGCC CCGCAGCGCC CTGCTGATGG CTGCGGCCAT CCCCGAGATG
ACCAGGACGC CCCCTGCCAT CGTGGAGGAG GTAGCCCGCC GCCTGCTTCA GGCCTGCGCC
GCCCGGCCAG TCGCCTGGCT GCGCCGGGAG GTGAGCGCCG CTCTTGCCCG CCTCCACCGC
AGGGAGGAGG CGTCTCTGCT GGTGGAACGC GCCCTGGTCG AGGCCCTGAC CGCGCCGCCG
CCCCCCGACG CGGCCGACCC GGGCCCGGCC GCAGCGGACC TGATCCGTGA ACACAAGTGG
TTCACCCCCG ACCTGGCCGA GGCCCTGGTG GAGGCCCTGC CCCGCGATCT GCCGGGAGGA
CTGGCCCATC CACCGCGCCC TGCTGGACCT GGCGCAACAG GCTCCCCGGG CCCTGCCCGC
CGACCGCCTG CCCTTCCGCC TCGCCCTGCT GGCCGACCCC GCCCTGGCCG CACGGGTGGC
CGGAGACCCC GACTGGCTGC GCCTGACCCT GGCCCTCTAC GGCGGACTGG ACGCGAAGGG
CTGCTTTGA
 
Protein sequence
MISPNDLFTN LAASLAYDLL KAGAARLTTF VSGSPEEQTL RRCYQSAFAA MLSEVTASLD 
DAHQELVEYI LRQFVIQPQV AEGLLALALK GADLPDLPRL RQHFDALGFD RATLPVDFDV
ALAAFTRGLT EALLDEAVRP GSPLYNRVSL GRVLALHALL QQQGKTLEEI RDAIRRLDAH
GGATYNVIIA RATGVAIGDG ARVEAALPAD VRDLLQQILQ ALKPPPDYTD ADLLNAYLDW
LIRQHSTLEL RGIHRAGPSL AVPLERVYAA LQAETVPPTE WQESYRLLEE DLQAWLEQQG
LDDLAEAERR RYRWRFLAGH PLMPALEERD RPRLFSDRKA ETLDLAQAVR RFRWLVILGD
PGSGKTTLLR WLTLHLARAL REGADRVRVP AMHVDAEADE DAPPVDLGPA RLPVLVRIGD
YAEARRATQA RSEPPPSLLE FLGHHGWQRD FPTFGREHLR RGERLPADGL RRLIRDFFRR
GQAMLLLDSL DEITAADDRR EIIGAVESFL QEWITDPGGR SPLDPGALPW RDAGALSPDE
SGGNQIIITS RIAGYHAAPL SLHLTHATLQ PMSDAAVDRF CQTWTLAVHR LLADPGDSEE
EVARRAADEA QALQAALHDP TRPGLRELAG NPLLLTILAM VHHNSQARLP EQRVRLYQIA
VENLVEVWRD TGLSEDEVVQ VLAPLAAHIH ERYPSGLIEE NELREQVTHS LAEYRGENPD
RPSPAFRRDV EAFLRAVRER VGLLSARGEA AYGFLHLTFQ EYLAARHLAG DPNAALEAIL
ARRDDPRWRE PVLLALGYLS WSQNMAARGR LLRAFLEADD PLGDLLPRSA LLMAAAIPEM
TRTPPAIVEE VARRLLQACA ARPVAWLRRE VSAALARLHR REEASLLVER ALVEALTAPP
PPDAADPGPA AADLIREHKW FTPDLAEALV EALPRDLPGG LAHPPRPAGP GATGSPGPAR
RPPALPPRPA GRPRPGRTGG RRPRLAAPDP GPLRRTGREG LL