Gene RoseRS_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0666 
Symbol 
ID5207604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp832832 
End bp835501 
Gene Length2670 bp 
Protein Length889 aa 
Translation table11 
GC content62% 
IMG OID640594282 
Productprotein kinase 
Protein accessionYP_001275035 
Protein GI148654830 
COG category[K] Transcription
[L] Replication, recombination and repair
[N] Cell motility
[R] General function prediction only
[T] Signal transduction mechanisms
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0515] Serine/threonine protein kinase
[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.415154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGT TGAATCTGAT CGGTCGAACG ATAGGACGCT TTGAGATCCT CAGCGAACTG 
GGACGCGGCG GCATGGCTGT GGTCTACAAG GCGCGCCAGA CGTCGCCCAA TCGAATCGTT
GCGCTCAAGG TGCTGCCGCC GGAGTTGAGT CTCGACCGTA CCTACATTGC TCGCTTTCGC
CAGGAAGCTG ACAGTGCAGC CGCTCTCGAG CATCCCAACA TCGTTCCAAT CTATGTGGTG
GACGAGGCTG AGGGTCTGCA CTATATTGCG ATGAAATTCA TCGATGGACG CACGCTGAAG
GAGATCATCC ACGAACGCGG CTCGCTGCCT CTCGATGAGA CGATCAGGCT GGTCGAACAG
GTTGCCAGCG CTCTCGACTA TGCCCACAGC CGGGGCGTCA TCCACCGCGA CATCAAACCT
TCCAATATGA TGCTTGATCG CAACGGCTGG GTCTACCTGA CCGACTTTGG ACTGGCACGC
GGCACGGGTG GAGGCGGCGG GTTGACCATC GCCGGAACGG TGATGGGAAC GCCGGAATAT
ATGTCGCCTG AACAGGCGCA GGGGTTGCCG AACGTTGGAC CGCCGACCGA CATCTACGCG
CTCGGCGTAG TGGTCTACGA GATGCTCACC GGTCGCATGC CGTTCAAAGC CGATACGCCG
ATGGCGATGC TGGTTGCGCG TCTCCAGCAC GCGCCCATTC CGCCGCGCGA TTTCCGTAGC
GATCTGCCGC TGCCGGTCGA GGATGTTATC ATGCGCGCGC TGGCGCGCAA ACCGGAGGCG
CGCTATCAGA GCGCCGGTGA ACTGGTTGCT GCGCTGAAGC AGGCTGCCGG TTTCGGCACA
GGTCCGATGC ATAGCGCCTC GCCGCCCGTT TCGCCGCCAG CCGGAACGCC GCTGCCCTAC
GTGCGCACCG TACCGCCGTC GCCTCCTGCC GGAACGACAG CACCCCGCCC GCCCGAATCT
CCTGTTTCCC CGGTCTACGG GCTGCCGACA CAACAGGCTT CACCGCCGTC CGGCGCTCCG
ACGATGCATG CCACGTCGCC TTCGTATGGA ACTCCGGCAC AACCCCCTGC TGGCGCGCCG
ACAATGCAGG CGATGCCTCC TGGAACCCTG CCGCCGCAGC CTGCCGCGCC GTCTGTGCCG
CAAGCGCCGG CGAAGCCGAA GAAAAGCGGC GGTATGGGGT TGATCGTCGG CGGGATCGCG
GCAGTCGTGC TGCTGGCGCT GGTTGCCCTG TTCGCGCTTC GCCCTGCGAC GGATGGACAG
AACCGACAGG TCGACGCTGC CCTCGCTCAG GCTCACGAAC TGTTCAACCA GCGCGGGAAA
CTGGATCAGG CTATCGAAGC CTATCAGGAA GTGCTCAGGA TCGATAGCGC CAATGCCGAA
GCGCGCACGC GCCTGGCGTT GATCTATCAG ATGCGCGCGC GCTATAGCGA TGCCGAAGCG
GAAGCGCGCG CCGCAATCGA TGCCGATAAC CGTGCGATTC TGGCGCACGC CGTACTCGCT
GAGTCGCTGC ATAGCCAGGG CAGGTACAAC GAGGCGCTCG ACGCGGCTGA TCGCGCGGTT
GCCGCTGATC CCGACCATCC GACAGGGTAT GGATCACGCG CCATTATCAA GGCAGCCCGC
GCCCTCGACG ACGCCGATGC GACCATGCTC GCCGAAGCGA TCGATGATGC CGAAATCGCG
CTCGAAAAAG CAGCCGGACG GGACAATCTG ATCCAGGCGC TGGCGCACAA TGCACGCGGC
GTTGTCTACT GGTATCAGTA TCTGTTCAGT AACGATGCAG CGATGGTTGC ACGCGGCGGC
GATGAATTTA ATCGCGCTAT CGGTCTCCAG GGGCAGATCG CTGTCTTCCA CTCTAACCTC
GGCTACTTCT ACAACGACCA GGGCGCGAGT GCGCTGCAGC GCGGCAACCG TCAGGAAGCC
GTGTCGCTGC TCGATCTGGC GCGGCAACAG TTCGAGCGTG CCCAGGATAT CGATCCGGTC
AATGGGCATG CGCACGCCGG GCTTGGCTGG AACCTCTATT TTTTGGAGGA TTATACCGGC
GCTGTGGCAG AGTTCGATAA AGCGATCGAA CTGAACCCGC AGGATACCGA TGCACATATT
GGCAAGAGTT ACGCGCTGCT GGGACTCTCG CCGCCCGACT TCGACGGCGC TATCGCCACC
CTGGAGCAGG CGACCACTGT GGCGCCGTAT GTCCCTGAAC TCTTCGCGCG CCTTGGATGG
ACGCATCTGA GCAAGGCGTT CGCAGCAGAA AGCGGCAGCG CAGCGCAGAC GGCGCTCTTT
CAGCGCGCAG AGGATCGTTT CCGCGAAGCG CTGGACCGCA ATGATCGTTT CGTCAACGCG
ATTACCGGTC TTGGATGGGC GCAGTCGGCG TTGGGGCAGT ACGATCAGGC GCTCGACACG
CTTCAGCGGT CGCTGGCGAT TAAGGAAGAC CAGGGAGATG CGCATTTTGG CATTGGCTGG
ACGTACTACA ACATGGGGCG CTTCACCGAT GCCGAGAGCA GTTTTCGTCG CGCCATCGAG
ATTCAGCCAC TCGATGGCAG CAATTACTAC TGGCTCGGAT TGACGCTCGA ACAGTTGGGG
CGTGTGGAGG AGGCGAAGCA GGCATATCGC ACCGCTGTCG AAAAAGGAAA CAGTTTCGCA
CAACAGGAAC TGGAACGCCT GGGGCAGTGA
 
Protein sequence
MPELNLIGRT IGRFEILSEL GRGGMAVVYK ARQTSPNRIV ALKVLPPELS LDRTYIARFR 
QEADSAAALE HPNIVPIYVV DEAEGLHYIA MKFIDGRTLK EIIHERGSLP LDETIRLVEQ
VASALDYAHS RGVIHRDIKP SNMMLDRNGW VYLTDFGLAR GTGGGGGLTI AGTVMGTPEY
MSPEQAQGLP NVGPPTDIYA LGVVVYEMLT GRMPFKADTP MAMLVARLQH APIPPRDFRS
DLPLPVEDVI MRALARKPEA RYQSAGELVA ALKQAAGFGT GPMHSASPPV SPPAGTPLPY
VRTVPPSPPA GTTAPRPPES PVSPVYGLPT QQASPPSGAP TMHATSPSYG TPAQPPAGAP
TMQAMPPGTL PPQPAAPSVP QAPAKPKKSG GMGLIVGGIA AVVLLALVAL FALRPATDGQ
NRQVDAALAQ AHELFNQRGK LDQAIEAYQE VLRIDSANAE ARTRLALIYQ MRARYSDAEA
EARAAIDADN RAILAHAVLA ESLHSQGRYN EALDAADRAV AADPDHPTGY GSRAIIKAAR
ALDDADATML AEAIDDAEIA LEKAAGRDNL IQALAHNARG VVYWYQYLFS NDAAMVARGG
DEFNRAIGLQ GQIAVFHSNL GYFYNDQGAS ALQRGNRQEA VSLLDLARQQ FERAQDIDPV
NGHAHAGLGW NLYFLEDYTG AVAEFDKAIE LNPQDTDAHI GKSYALLGLS PPDFDGAIAT
LEQATTVAPY VPELFARLGW THLSKAFAAE SGSAAQTALF QRAEDRFREA LDRNDRFVNA
ITGLGWAQSA LGQYDQALDT LQRSLAIKED QGDAHFGIGW TYYNMGRFTD AESSFRRAIE
IQPLDGSNYY WLGLTLEQLG RVEEAKQAYR TAVEKGNSFA QQELERLGQ