Gene Rcas_0255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0255 
Symbol 
ID5537717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp312484 
End bp313557 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content60% 
IMG OID640892419 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001430406 
Protein GI156740277 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000605815 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.153508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAG CGCATGTTGG TGTGCGCCAG CGTATCGAAG CGGATGAGCG CGCGCGCCTT 
TCGCCTCTGG CGGCGTTCAG CGATAGCGCC CACCGCGCGC GCCCCGAAGA GGCGTCGCCG
GTGCGGAGCA GTTTTCAGCG TGATCGCGAC CGCATTCTGC ACTCGAAACC GTTTCGCCGC
CTGAAACATA AAACGCAGGT GTTCATTTCA CCGCAGGGGG ATCACTACCG CACCCGCCTC
ACCCATACGC TTGAAGTGAC GCAGATTGCG CGCACGGTCG CGCGCGCGCT GCGCCTCAAC
GAAGACCTGG TCGAGGCGAT TGGTCTGGGG CATGATCTCG GGCATACCCC CTTCGGTCAC
GCAGGTGAGA CGGCACTGTC GCACGCGATG GGGCGCGCCT TCCGTCATAA CGAGCAGAGT
CTACGCATTG TCGATGTGCT GGAACGGAAC GGCGCCGGGT TGAATCTGAC CGATCAGGTA
CGCGAGGGCA TTTATATGCA CTCAAAAGCC CGGCGCGACA TCACCATGCG CGCCTGGGGC
ACAGCCAGCA CGCTCGAAGG GCAGATCGTC AAACTGTGCG ATGCCATCGC GTATATCAAT
CACGACATCG ATGATGCGAT CCGTGGCGGG TTGTTGCACC CCGACGATCT GCCGCGCGAC
GCTATCGAGG TCCTCGGCGA AACGCACGGT CAGCGCCTCG ATACGATGGT CTGCGACCTG
GTCGACCATA ACTGGTGGGC GACCGGCGAA GCGCCGCCAC CCGACCCACC GGAATTGTCG
ATGAGTCCAG ATGTGCTGAA GGCGACCAAT ACGCTGCGAG AGTTTCTGTA TCAGCGCGTG
TACCTCGGAT CGAGAGCCAA AGCGGACGAC AGTAAGGTAT ACTTGATGAT CGAGTTGCTG
TACCATCATT TTCTGAAACA TCCAGAGCAG CTTCCCACAG ATCTGCTGCG GATCAATCAG
GAACGGAACG AACCAATCGA ACGCGCCGTC GTGGACTATA TTGCCGGTAT GACCGACCGC
TTTGCGTTGA AGGTGTTCAA CGACCTGTAT GTACCGCGCA CCTGGAGTGC ATGA
 
Protein sequence
MTRAHVGVRQ RIEADERARL SPLAAFSDSA HRARPEEASP VRSSFQRDRD RILHSKPFRR 
LKHKTQVFIS PQGDHYRTRL THTLEVTQIA RTVARALRLN EDLVEAIGLG HDLGHTPFGH
AGETALSHAM GRAFRHNEQS LRIVDVLERN GAGLNLTDQV REGIYMHSKA RRDITMRAWG
TASTLEGQIV KLCDAIAYIN HDIDDAIRGG LLHPDDLPRD AIEVLGETHG QRLDTMVCDL
VDHNWWATGE APPPDPPELS MSPDVLKATN TLREFLYQRV YLGSRAKADD SKVYLMIELL
YHHFLKHPEQ LPTDLLRINQ ERNEPIERAV VDYIAGMTDR FALKVFNDLY VPRTWSA