Gene Rcas_4117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4117 
Symbol 
ID5541628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5329318 
End bp5331321 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content60% 
IMG OID640896229 
Productserine/threonine protein kinase 
Protein accessionYP_001434167 
Protein GI156744038 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00545764 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAACC TTCCTGAATG GATCGGTCAT TATCACCTGG AGCGTCAGAT CGGCAAAGGC 
GGCATGAGCC TTGTGTGGCT GGCGCGTCAC CGCACGCTTC GCGCTCGTCA GGTGGCGATC
AAGGTGCTCC TTTCACAGGA ATCCGAGTGG GTCGAGCGCT TCACCCGCGA AGCGGAGATC
ACAAGTCAAC TCCGCCACGC GCACATTGTG CCGATCTATG ATCACGGCTA TCAGGCGCCG
TTCTACTACA CGGTGATGGA GTATGTCGAG GGCGGATCAC TGCGCGATCT GCTCAGTAAG
CGTGGACGGT TGCCGCTCGA CCTGGCACTG CACATCTTTC GTTGCGCTGC GGCTGCCCTC
GACTATGCCC ACGCGCACGG GGTCATCCAT CGTGATATTT CGACCGGAAA TATTCTGGTG
GATCAGGATG GCGCGCGCGT GTTCCTTGCC GACTTCGGCA TTGCCCGCGA GTCGGGCAAA
ACGTCGCTGA CGACGGTCCA CAAAGTGATG GGAACGCGGG GGTTCTTTTC ACCGGAGCAC
ATTGCCTCGG CAACCGCCGT GACTCATCTT TCCGATCTGT ACAGCCTCGG AGTTGTGCTG
TTCGTCATGC TCACCGGCGC CCTGCCCTGG TCCTACATCC CAGGACCCGG CGAGGATGGC
GGACCATTCG TGCCACTGAT GTCGCTGCGC GACCGCGGGG TGACCGGCTT GCCAACCGAA
CTCGACACTG TCATCCATGC CATGCTGGCG CCAGACCCGT CTAAACGGTA TCCCAGCGCT
CAGGCAGCGG TCGAGGCGCT GGAACAGGTG TTACGTCGTC ATACCAGTAC AACGCAGGTG
ATGGCCGGCG CACCCGCCAG CGCGCCACCG GTTCCGGCAG CACCGCCGGA AGAGCCACAT
CCGGTCGAAC AGACGCTGGC GCCTGATCTG ATTAAGGCGC CAATCCAGGA AGCGCTGAAG
CGCGCGCGCG AACTCAATGA CCCGCGCGAA ATTGCCCGCT TGCTGAATGC CTGGAGCAAT
GCGCGACCAT GGCTGCGCCG TCCGTCTTTG GGGCGCATGG CGGCGATCAG GCAGATCGGG
CATCGCAACG TTTACTGGTA TACCCTGCGC GTGCTGTATG AGACGCGCGA ACCGGCGCAG
ACGGCGGAAG AACCGGATCA CCAGATGACG AACCACAAAC TGGAACGCGA ACCCGGTCGC
TGGGAGGTGC CGCTTCCGGC GCCAAAAGGG TTCGAGAACG ATCCCGGCGG CGTGGTGCGC
ATTCCCGGTT CAACGCGCGT TGTGCTCTGT GATGATTGTA AAGGTATTGG GCGGACGATC
TGTCCCCGCT GCAACGGCAA TCGGCGCATT CCGGCGCCAG CCGACCCCAC GCCGCCCGTC
GTGACGACAA GTGCGTCGGT GTCCGCTGAA TCGGCGACGT CCGCCCCGGC ACAGGCTGCG
GCAGCGCCAC GGTTGATCCC GTGCCCTGAT TGCCAGGGAA GCGGCGGATT ACACTGCAAA
CGCTGCGATG GGACCAGCCG CCTGGTCGTT CACAAAACCT TGCGCTGGCA TCGCCGGGCG
GAAAGGATGA CTGCCCGCGA TGATCTACCG CGTATCGATG AAGATTGGTT GGAAAAACGC
TGCAAGAAAC ATACTATCTA TCGTGAGCAG GAGAACGGAG GGTTTCGTCC TGAATGGCGC
CTTGTGCCGA ACATTCAGGC AATGATTCAG GAGGCGGAAG CATGCCTGAA TCCCGATACG
CGTATTCTGT TCAGCGAAGT GACTGTGCGC TTTATTCCCA TCACCGAGAT CGTGTTCGAT
CTCGATGAAT GGAAACCGGC AAAAGCCACC GGACAGAAAG GTCAGTCGCG CGAACCGGTG
CTCTACCGCT GGTATATCTA TGGCTTTGAA AACATCTTGC CTGATGACCG GCGTTTTTTG
AACCGGGATC GGATTATTGC ACTGGGGGCG ACCGGCGTGA GCGTTGCGGC AATCATTGCG
CTGATTTTGC TACTGGTGCT GTGA
 
Protein sequence
MSNLPEWIGH YHLERQIGKG GMSLVWLARH RTLRARQVAI KVLLSQESEW VERFTREAEI 
TSQLRHAHIV PIYDHGYQAP FYYTVMEYVE GGSLRDLLSK RGRLPLDLAL HIFRCAAAAL
DYAHAHGVIH RDISTGNILV DQDGARVFLA DFGIARESGK TSLTTVHKVM GTRGFFSPEH
IASATAVTHL SDLYSLGVVL FVMLTGALPW SYIPGPGEDG GPFVPLMSLR DRGVTGLPTE
LDTVIHAMLA PDPSKRYPSA QAAVEALEQV LRRHTSTTQV MAGAPASAPP VPAAPPEEPH
PVEQTLAPDL IKAPIQEALK RARELNDPRE IARLLNAWSN ARPWLRRPSL GRMAAIRQIG
HRNVYWYTLR VLYETREPAQ TAEEPDHQMT NHKLEREPGR WEVPLPAPKG FENDPGGVVR
IPGSTRVVLC DDCKGIGRTI CPRCNGNRRI PAPADPTPPV VTTSASVSAE SATSAPAQAA
AAPRLIPCPD CQGSGGLHCK RCDGTSRLVV HKTLRWHRRA ERMTARDDLP RIDEDWLEKR
CKKHTIYREQ ENGGFRPEWR LVPNIQAMIQ EAEACLNPDT RILFSEVTVR FIPITEIVFD
LDEWKPAKAT GQKGQSREPV LYRWYIYGFE NILPDDRRFL NRDRIIALGA TGVSVAAIIA
LILLLVL