Gene Rcas_3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3864 
Symbol 
ID5541368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5050359 
End bp5052335 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content59% 
IMG OID640895973 
Producthypothetical protein 
Protein accessionYP_001433918 
Protein GI156743789 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0913567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.365834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGATC TTCCGATTCG CCGTATGCTG GTGTACCGGC ATGGCGTTGG TTACTTCGAG 
CGACGGGGTC CAATCACCGG TACAGAACTG CGTCTGACGT TCCCACGGGA AGCCATGGAC
GACATTCTGA AAAGCCTGAT CGTCCTCGAT CTGGGTGAAG GTCAGGTTTT GGGTGTCGAT
ATCGAAACGC CGGAGGATCG CGCAAAGCAG ATCGAGCGCG GCTCCATCCA CCTTTCCAAT
ACGCGCAGTC TGCTCGATCT GCTGCGGGAT TTGCGCGGAC GCAACGTGCG CCTTCAGATT
GAGCACGAGC GCCGCCATGG TGATGCCGCC GATGAGGTAA TCGAAGGCGC AATCATCGGT
ATTGATCTCG ATGAGCAAGA ACCGCTCGAC AAACCGCTGC TTTCGCTCTA CCTGTCGAAA
CAGCGCAACG TACGCACCAT ACCGGTGCGG CGCATCGCGC ACCTGGTAAT TCTCGATGAC
CGTGCGGCAG CCGATATGGC GTATTTCCTG CGCGCGGCAC AGAGCGAGGA AGATCGACGA
TCCGCCATTG TGCGACTGTC GGAGGGCGAC CACGACATGC TGGTGGGATA CATCGCTCCT
GCCCCCTCCT GGCGCGTCAG TTATCGCCTG CTGGCGGAGC CGAAACCCGA CGGCAACGAC
TCCGCCAATG GCGGTGGACG TAGCGCAGGT GCGCAAGTGG CGGTTCTCCT TCAGGGGTGG
GGACTGTTCG ATAATCAACT CGACGAAGAC CTCGAGAGCA TCGAGTTGAC GCTGGTCGCC
GGCATGCCAG TGTCGTTCCG CTATCGCCTG TATGAACCCC ACACGCCTGA ACGACCCATG
GTACAGGACG ATGTGCGCAC GGTGGCGGCG CCGATTGAGT TCCAGGCAAA CCGAGCGATA
CCAAGCTTAA TGGAAGTCGC TCCGGATTTG GATGAATTTG CACTCGGCGA GGCGTCCGCA
CTCAGGATGG AGAATCTCGA ACAGTCCATT GAAGCCGCCG GCGTCGGTGA AGAACGTGGC
GCTCTCTTTC AGTACCGTGT TGTGCATCCT GTCAGTGTGG CGCGGGGACG ATCCGCCATG
GTTCCGATTG TCAGCCGACG CCTCGATGGA CGCAAAGAAT TGCTCTACAA CGGTCGCAAA
CTGCCTCGCC ACCCGGTTGC AAGCCTGCGC ATGCAAAACG AAACCGGGCT GACGCTCGAA
CGCGGACCGG TGACGGTCGT CGAGCATGGC GACTACGCCG GTGAAGCCGT GTTGCCCTTC
ACGCGCGCAG GGGCTGAAAT GATTATCGCG TATGCGGTCG AACTTGGGGT GACGATCAGC
GAAGAACGCC ATCATCAGCG CACAATGGCG GGGTTGAGCA TCCACAAAGA GTATGCGGTG
TTTGAGGAAT GGGATGTCCA GCAGATGCGC TACCGCATCA CCAGCACCCT GCCCGACGCC
GTAAACATTG TGATCGAACA GGAACGGTTG AAGGGCTACG ACCTGTTCGA CACACCCGCT
CCAGACGAAG AGGCGCACAA TGTCGCGCGC TGGACAGTGC GGTGCCCGTC TGGCGTCGAA
ACCGTTTTCA TGGTCAACGA ACGCTGCAAG AGATCACGCC ACGAGGAGGT GCGCAAACTC
GATATGCACC GGTTGCAGTC GTTCCTGAGC GACAGATATC TCGATCAGGC GACCTACCGA
GCGCTCGAGC GCATTCTGTC GCTGTATGAT CAGGTTGCGA AGCGCCGTGC AACGCTTCAA
GAGATTGCCC AGGAGCAGCA GAAAATCCTG GCGCGCCAGC AGCAGATCCA GGCAAACCTG
GGACCGTTGG GGCGCGAGGG GAGTGAACGG GCATTGCGCG AACGGTATGT CGCGCAGCTC
AATCAACTCG AAGATCGCCT GAATGACCAG CTTGCCCGCG AGCAGGAGAC CCGCAAGGCA
ATCGAGCGAC TGGAGCAGGA AGCAGCACAG GCGCTTGCAG CATTGTCGAA GCCATAA
 
Protein sequence
MPDLPIRRML VYRHGVGYFE RRGPITGTEL RLTFPREAMD DILKSLIVLD LGEGQVLGVD 
IETPEDRAKQ IERGSIHLSN TRSLLDLLRD LRGRNVRLQI EHERRHGDAA DEVIEGAIIG
IDLDEQEPLD KPLLSLYLSK QRNVRTIPVR RIAHLVILDD RAAADMAYFL RAAQSEEDRR
SAIVRLSEGD HDMLVGYIAP APSWRVSYRL LAEPKPDGND SANGGGRSAG AQVAVLLQGW
GLFDNQLDED LESIELTLVA GMPVSFRYRL YEPHTPERPM VQDDVRTVAA PIEFQANRAI
PSLMEVAPDL DEFALGEASA LRMENLEQSI EAAGVGEERG ALFQYRVVHP VSVARGRSAM
VPIVSRRLDG RKELLYNGRK LPRHPVASLR MQNETGLTLE RGPVTVVEHG DYAGEAVLPF
TRAGAEMIIA YAVELGVTIS EERHHQRTMA GLSIHKEYAV FEEWDVQQMR YRITSTLPDA
VNIVIEQERL KGYDLFDTPA PDEEAHNVAR WTVRCPSGVE TVFMVNERCK RSRHEEVRKL
DMHRLQSFLS DRYLDQATYR ALERILSLYD QVAKRRATLQ EIAQEQQKIL ARQQQIQANL
GPLGREGSER ALRERYVAQL NQLEDRLNDQ LAREQETRKA IERLEQEAAQ ALAALSKP