Gene Rcas_3489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3489 
Symbol 
ID5540988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4552145 
End bp4555264 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content62% 
IMG OID640895607 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_001433557 
Protein GI156743428 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAG GCGAGCAAGT CATCCGTGAT CGCTACCGCG TGATCTACAC CGTCGATGAG 
CGTCCCGGCG TGAAAATCTA TCGTTGCCGC GATGATCAGA GCGGTGAGTT GTTGCTGGTA
GCCGGGTTTG CGCCGCCCGA AGGGGCGCTC AGCGATCTGG ACATTCTTGC AAAGCAGATC
GCGGCGGTGC GCCACGAGGC ACTGTTGCCG CTACGCGACC ATTTTGCCGA GGGTGCGCTC
TACTACATGG TTTGCGCCGA TCCCGGCGGG CAGGATGTTG AACGAGCGAT ACGCGCGCGC
GGCGGTCCGC TGCCGGAAAC CGACGTGCTG ACGCAGGCGA CCCGATTGCT CTTGCTGCTG
GAACATCTGC ACAACCAGCG TCCGCCGTTG TTTCTGGGCG ATCTGGCGGT TGTCGATGTT
TGGGTCAACG ACAAAGGCTC CTTTCTGATT ACTCCCTTTA CATTAGCGAC GCCAATCGGT
CAGTCGCCGT CGCCGTACCG CGCGCCTGAA CTCTCGCGCC CCGATGCAGA GCCTACAACG
GTGAGCGATG TGTATACCAT CGGCGCGCTG ATGTACCACG CACTGACCGG CTGGGCGCCG
CCGACGCCCG CGCAACAGGA AGCCGGGATG CCGCTGGCGG GTCCGCGATC GCTCAATCCG
CACATCTCGC CGCTGGTCGA GCAGGTGTTG CTGCGCGCTC TGCAATTGCG CCCGGTCAAT
CGGTTTCAGC AGGCGCGCGA AATGCGCATT GCGCTCGAAA CCGCGCAGAT GATGGGCGGA
CGTTCGCTTG GTCTTGGTCC TGATGTGCTG ACGCACGCAA CGCCATCCCC TCCTTATGAA
GAACAGCCGC AGCAAACGGC AGAGAATGTC GTGCATGCGG CACCGGGCAT TGCGCCGCCC
GCGGCGCCAG CGCCTGTTCC CGTGGCTTCG GTTGCTCCAC CGCCTGCAGC GCCGGCGCCC
TATCCGCCGC CAGGGTATCC AACCGGCTAT GCGCCTGCTC CCCCACGGCA GGGTTTGAGC
ACCGGCTGTC TCGTCACTTC TGCGGTGCTG CTGACCGTTG CTGCGATTGG GGTCTGTCTG
GCGATTGCGG TCTTTCTGCC GGGCAGTCCG CTGCGCCAGA TGCTCGGCAT GAGCGGCGCC
GCCGCTGCGC CAACATTGGC GCCAACGGAA GCGTCACCCG CAGCAACGGT GGCGCCGACT
GCAACGAGCG CTGCGAACGT GGAACCGACC GTCCCGCCCC CTACGCCAAT ACCTGCCAAT
GGATCATCTG CGCCTGGACC GGATGCTATC TCGCCACAGA ATGTCACGGC CATCACAGCA
ACACGCCAGC TGTCCATGTC GGTATTTGGT CCTGTCGCTT ACTCGCCTGA TGGACGGCTG
CTCGCCGTGG GGATAAGCGA AGCGGTCAGC CTGCATGATG CTACAACGCT CGATGACCTT
GGAACCTGGT TTGACCACAC GGGCAAAATC ACATCACTCG CCTGGTCTGC CGACAGCACC
TTGCTGGCGT CGGGCGCCAG CGACGATAAC GAAATTCGCA TATGGGACGT ATCGACCGGA
CGGGTGGTTC GGCGTCTGAG CGGTCATACC GGCTGGATTC GCAGCATCGC CTTCGCTCCC
AATGGCACGC TGCTAGCATC GGGGAGCACC GATCAAACAG TGCGTATATG GGATGCCGCA
ACCGGTCAAC TGCTGGCGAC CCTGAGCGGA CACACCGGCT TCATTGGCGG TGTGGTCTTT
TCCCCTGACA GCACGACGCT GGCATCCGCC TCGCGTGATG GCAGCGTGCG CCTCTGGGAC
GTGGCATCCG GGCGTGAAAT CAGTGGCTTC AATTTTCGCA CTCCGCTCGA CCCGGACACC
AACCTGCGCT ACTGGGCGAC CGGTGTCGCG TTTTCGCCTG ACGGCAAGGC GCTGGCAGTC
GGATCGACCG AGGGGGTCGT CTATCTGCTC GATGCCGCCA CCGGTCAGGT CATTCATCAG
TTGCGCGGTC ATACGAACTG GATCGTCATT CGCGGACTGG CGTTCGCTCC TGATGGAAAG
ACCCTCTACT CAGCCGGGCT GGACGCAACT GTGCGCATAT GGGATGTGGA ACGCGGCGTG
CAGACCGGTG TACTCGATGT TCACCGTCTC GACATTTTCA GCATTGCTAT CAGTCCGAAT
GGTGAGCGCC TGGCGTCGGT CAGCGATCAG GAAGGGCGCA TGATCGTCTG GGACCTGACA
CAGCAGCGCC CCGACATGAA CCTGCGGATC GGGCTGGGAT TGGTAACCTC GCTCGTCTTC
TCACCCGATA GTGAGGTACT TGGCGCCGTC GGGTACAATG GCATCATTCA GTTGCGGTTG
CTGGCGAATG ATCAAATCCG TCAGTTTGCA GGCTCTTCCA CGTCGGTGCA ATCACTGGCA
TTCCTGCCCA ATGGTCGTCT GGCAACGATT ACCGAGCAGG ATACGGTGGT TATCCTGGAT
TTTCTGCGCG AAACATCCAG CGATCTGACC GGATCGACCG GCGATCCGCT CTGCATTGCT
GCCGATCCGG GTGGAAAAGT GGTTGCCGCC GGCGCAAACG ATGGCACGGT GGCGATCTGG
AATGGCGCCG ATGGTCAGTT CCTCCGGTCG CTCAAGACCG ATCTCCCGGC GGTGTTTCTG
GTGGCGGTCA GCGATGATGG GGCATTCGTG GCTGCTGCCG GCACGCCGAA TGATCCACGC
ATCGAAATCT GGCGTGTCGC CGATGGGCAG CGCGTCCAGA CGTTGAGCGG CATGCAAAAC
TCCATCACCA GCATTGCGTT CCAACCGAGA GGGACGTTGT TTGCAGCAAC GGGGACCGAT
GGTGTGCTGC GTATGTGGAA TTATCGCACG GGCGCTTCGG AGCGAAATAT TAAGGCTGCG
CCGGAAAATG GTTGGTTTAC TGCACTGGCA TTCTCTCCCG ATGGCGCAAT CCTTGCGACC
GGCACACCCA CCGGCGTGGT GCAGTTCTGG AATCCGGCGA ACGGCGCAGA GATGGCGCAG
GTTCAGCAGC AATTCGGCGT GCTGGCGCTG ACGTTCAGCC CTGATGGCGC ACAACTCGCC
GCCGCCGGTC GTGATGCGGG CGTGACGCTT TATCGGGCTG TGCGCAGTGG TTCATCGTAG
 
Protein sequence
MSSGEQVIRD RYRVIYTVDE RPGVKIYRCR DDQSGELLLV AGFAPPEGAL SDLDILAKQI 
AAVRHEALLP LRDHFAEGAL YYMVCADPGG QDVERAIRAR GGPLPETDVL TQATRLLLLL
EHLHNQRPPL FLGDLAVVDV WVNDKGSFLI TPFTLATPIG QSPSPYRAPE LSRPDAEPTT
VSDVYTIGAL MYHALTGWAP PTPAQQEAGM PLAGPRSLNP HISPLVEQVL LRALQLRPVN
RFQQAREMRI ALETAQMMGG RSLGLGPDVL THATPSPPYE EQPQQTAENV VHAAPGIAPP
AAPAPVPVAS VAPPPAAPAP YPPPGYPTGY APAPPRQGLS TGCLVTSAVL LTVAAIGVCL
AIAVFLPGSP LRQMLGMSGA AAAPTLAPTE ASPAATVAPT ATSAANVEPT VPPPTPIPAN
GSSAPGPDAI SPQNVTAITA TRQLSMSVFG PVAYSPDGRL LAVGISEAVS LHDATTLDDL
GTWFDHTGKI TSLAWSADST LLASGASDDN EIRIWDVSTG RVVRRLSGHT GWIRSIAFAP
NGTLLASGST DQTVRIWDAA TGQLLATLSG HTGFIGGVVF SPDSTTLASA SRDGSVRLWD
VASGREISGF NFRTPLDPDT NLRYWATGVA FSPDGKALAV GSTEGVVYLL DAATGQVIHQ
LRGHTNWIVI RGLAFAPDGK TLYSAGLDAT VRIWDVERGV QTGVLDVHRL DIFSIAISPN
GERLASVSDQ EGRMIVWDLT QQRPDMNLRI GLGLVTSLVF SPDSEVLGAV GYNGIIQLRL
LANDQIRQFA GSSTSVQSLA FLPNGRLATI TEQDTVVILD FLRETSSDLT GSTGDPLCIA
ADPGGKVVAA GANDGTVAIW NGADGQFLRS LKTDLPAVFL VAVSDDGAFV AAAGTPNDPR
IEIWRVADGQ RVQTLSGMQN SITSIAFQPR GTLFAATGTD GVLRMWNYRT GASERNIKAA
PENGWFTALA FSPDGAILAT GTPTGVVQFW NPANGAEMAQ VQQQFGVLAL TFSPDGAQLA
AAGRDAGVTL YRAVRSGSS