Gene RoseRS_3980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3980 
Symbol 
ID5210963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4978646 
End bp4981852 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content60% 
IMG OID640597571 
Productnon-specific serine/threonine protein kinase 
Protein accessionYP_001278277 
Protein GI148658072 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.863358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.858201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAATA TTCTCTCGGC TGACATATTG GACATCGATC ACATTCTGCT ATCGACCTCA 
CAAACGGTTG TTCAACAGGG TCGGCGTTAT TACGAACGAG GGCAGGTGGA AGTCGAGCGC
GTCGATACCG ATAGCGCGCT GATCAATGTC TTCGAATCGC CGGGAGTGGC GCATCAGGTC
GTTCTTCGTC TGCACAACCG CCAGATCTTC GTCAGTTGCA CCTGCCCGCA GCGGCATTAC
TGGACGTTCT GCCGTCATCG CGTCGCTGCG ATTCTGGCGC TCCGTGATTA CCTGATAGCC
CATCCCCCCA GCATCTGGCG CGCCGTGATC GGCGAGGCGG TCAGAGCGCA ACCCCAACGC
AAGCCGACTG CGCAGCCAGC ATGCATCGTC TTCAGTCTGC AATACGCTTC GACCGCCTGG
ACAATCGTTC CATATGCGAT CCCGCTGCGG CGCCTGCCAC CGGAAGCCCT GGGTGATCAG
GAAGAACTCG CCGCTGCCAT TGCTGCGCTG AACCTGTCGC ACGAGGCGCG CCTGATCCGC
TCACGCATCT CCCGGCAGAC GTATCCCCAC GCATCCTTCG AGACGATTAT CGCCGTCAAT
ATGGCGCTCG CCACTGCTGA AGGGTTCCTG TACCACTATA CCAGCGATGA TGAGTCAGGT
GCAGTGTACG ATCCGCTGTT ACTGCTCCTT GCGGGGCAAC TGGCGTACCT TGGCGATGAG
GGCGATCCGT TGCAACATCG CATCCGCATC ATGACCGATC CGGCGTCGGT CGAACTCCGG
ATCGAGCAGA ACGACCGGGA TGCGATCCGT ATGCGACTGC ACCTGAACGT CGCCGGTGAA
TCGATCCCGC TCGATCCGCG CACAACCTAT GTATTCAGCA ACAGTCCGCT CTGGCTGATA
GTGGATGACC GATTGATCCC GGTGAGCGAC CCTCACGGCG TCGCCGCTCT GCTTATCAAT
CACCCCGACC TGACGATCCC TGCGCATGAG CAGGACGAGT TCTTCGATCA GTATCTGCTG
CCGCTGGCTG AGCGGATGAC GCTCTGTGGC GATCTGTTGC GCTGGGAAGA GATCGATGTC
TCGCCAACCC CGCGCCTGTA TCTGAGCGAA ACGAACGGCG CGCTTCTGGC AGACCTGCGC
TTTGCCTACG GCGAGTTCGA AGTACCGGCT GAAGTGAACC CGCCGCCGTA TGTCGTTCGC
CGGCAGGCAA ACACGTTCAC CCTGATACGC ATCCGGCGCC GCCCTGAAGA AGAAGCCGCA
TTGCTGCGCG ACATCGGCAG TCCGACATAT GGTTTGAAGA AAGGCGCAGA GATCGGTCGG
TTCGAACTGC GCAAGGCGGT GCATCCGCTC GATTTTCTGA TCCATCGCAT TCCGATGCTC
ACCCGGCAGG GTGTAGAGGT CTTTGGCGAG GATCAGATCA AAGCGGCGCG GGTCAACCGC
AACCAACCGC AGATCTCGTT CCGTGTTAGT TCCGGCATCG ACTGGTTCGA CCTGGAAGCG
GTGGTTCGCT TTGGCGATCT CGAAGCGTCG CTCAAAGAGG TGCGCCGCGC CATTCGCCGC
CGCCAGAACT ATGTGCGCCT GGCGGATGGC GCGATCGGCG TCATTCCCGA TGAGTGGATC
GAACGCTACC GTCACCTGTT TGAACTGGGG GAAGATACCG AAACGGGTGT CCGCCTGACC
AGAGGGCACC TGACATTGCT CGACCAACTC CTCGCCGAAT CCGACCATGT CCAGACCGAT
GAGGAGTTTG AGCGCCAGCG GGAGCGCCTG CGATCCTTTG AGCGCATCCA GGATCAACCG
CTGCCGCGCG GATTCCGCGG CGAACTGCGT CATTACCAGC GCGCAGCGTA CAACTGGCTC
CACTTCCTCA ACGAATACGG CTTTGGCGGA TGCCTGGCGG ACGACATGGG CACCGGCAAG
ACGGTCTGCA CCCTGGCGTT CCTGCAATCG CTCGAAGAGC GCGATCCCGA CGGTCCTGCC
AGTCTGATCG TTATGCCGCG CTCGCTGCTG TTCAACTGGG AACGTGAAGC TGCGACCTTC
ACGCCCGATC TGCGCGTTGC CGTCCATCAC AGCGCCGTGC GGTCCCTCAA CCCGGCGCAG
TTCGACAGGT ACGACCTGGT GTTGACGACC TACGGCACGA TGCTGCGCGA CATCGAACTG
CTGCGTCGTT ATCGCTTTCG CTACGTCATC CTCGACGAAT CGCAGGCGAT CAAGAATCCG
CTGGCGGAAA CTGCAAAAGC GGCGCGCCTG CTGAACGCCC AACGCCGCCT GGCGCTCACC
GGCACCCCGG TCGAAAACTC GACGCTCGAA CTCTGGAGCC AGTTCGCCTT TCTCAATCCC
GGACTGCTCG GCAACCTGGA TTATTTTCGT GAAACGTTCG TGACTCCCAT CGAAAAGAAG
CAGAGCGCCG ATGCCGCCCA GTTCCTGCGC AAACTGGTCT ACCCGTTTAT TCTGCGCCGC
ACGAAAGACC AGGTGGCGCC GGAATTGCCG CCGCGCAGCG AACGGGTGAT CGAAGTCGAG
ATGGAACCGG CGCAGCGCCG CCTGTACATC AAACAGCGCG ACTACTATCG CGCACTGCTG
CTGGGGCTGA TCGATAACGC CGGGATCGAC AACGCTCGCA TGCAGGTGCT CGAAGGATTG
CTGCGCCTGC GCCAGATCTG CAACCATCCC CGCCTCATCG AGCCAGATTT TCGCGGATCG
TCGGGCAAGT TCGAACTGCT CATCGAAACG CTCGAAACGC TGGCTGCCGA AGGGCGCAAG
GCGCTGGTCT TCTCCCAGTT CGTTCAGATG CTCACCCTCA TCCGCGAGGC GCTCGACGCA
CGCCGCATTC CGTATGCCTA TCTCGACGGG CAGACCCGCC AGCGTCAGCA GGAGGTCGAC
CGCTTCCAGA GCGATGAAAC CCTCCCTTTC TTTCTGATCA GCCTGAAAGC TGGCGGCGTC
GGGCTGAATC TGACCGCTGC CGATTATGTC ATCCACGTCG ATCCCTGGTG GAACCCGGCA
GTCGAGATGC AGGCAACCGA CCGCACCCAC CGCATCGGTC AGGAAAAACC GGTCTTTGTC
TATAAACTCG TGACCCGCGA CTCGGTTGAA GAGAAGATTC TGCATCTCCA GAATCGCAAA
CGTGAACTTG TTGAACAACT CATCACTGCG GACGCCAGCA TGCTCAAAGC GCTCACACGC
GAAGATGTCG AGGCGCTGTT TGGGTGA
 
Protein sequence
MWNILSADIL DIDHILLSTS QTVVQQGRRY YERGQVEVER VDTDSALINV FESPGVAHQV 
VLRLHNRQIF VSCTCPQRHY WTFCRHRVAA ILALRDYLIA HPPSIWRAVI GEAVRAQPQR
KPTAQPACIV FSLQYASTAW TIVPYAIPLR RLPPEALGDQ EELAAAIAAL NLSHEARLIR
SRISRQTYPH ASFETIIAVN MALATAEGFL YHYTSDDESG AVYDPLLLLL AGQLAYLGDE
GDPLQHRIRI MTDPASVELR IEQNDRDAIR MRLHLNVAGE SIPLDPRTTY VFSNSPLWLI
VDDRLIPVSD PHGVAALLIN HPDLTIPAHE QDEFFDQYLL PLAERMTLCG DLLRWEEIDV
SPTPRLYLSE TNGALLADLR FAYGEFEVPA EVNPPPYVVR RQANTFTLIR IRRRPEEEAA
LLRDIGSPTY GLKKGAEIGR FELRKAVHPL DFLIHRIPML TRQGVEVFGE DQIKAARVNR
NQPQISFRVS SGIDWFDLEA VVRFGDLEAS LKEVRRAIRR RQNYVRLADG AIGVIPDEWI
ERYRHLFELG EDTETGVRLT RGHLTLLDQL LAESDHVQTD EEFERQRERL RSFERIQDQP
LPRGFRGELR HYQRAAYNWL HFLNEYGFGG CLADDMGTGK TVCTLAFLQS LEERDPDGPA
SLIVMPRSLL FNWEREAATF TPDLRVAVHH SAVRSLNPAQ FDRYDLVLTT YGTMLRDIEL
LRRYRFRYVI LDESQAIKNP LAETAKAARL LNAQRRLALT GTPVENSTLE LWSQFAFLNP
GLLGNLDYFR ETFVTPIEKK QSADAAQFLR KLVYPFILRR TKDQVAPELP PRSERVIEVE
MEPAQRRLYI KQRDYYRALL LGLIDNAGID NARMQVLEGL LRLRQICNHP RLIEPDFRGS
SGKFELLIET LETLAAEGRK ALVFSQFVQM LTLIREALDA RRIPYAYLDG QTRQRQQEVD
RFQSDETLPF FLISLKAGGV GLNLTAADYV IHVDPWWNPA VEMQATDRTH RIGQEKPVFV
YKLVTRDSVE EKILHLQNRK RELVEQLITA DASMLKALTR EDVEALFG