Gene Rcas_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3931 
Symbol 
ID5541437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5128266 
End bp5129984 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content60% 
IMG OID640896039 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_001433982 
Protein GI156743853 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.420242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0571898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACTC CCCGGTTATC CGCGCGAAAC AAACGCTGGC TCATCCACGA AACGCCACGC 
GAGTTTATTG CTGCCTGGCG CCAGTTTCCG CCGCTGATCG CTGCGGTGCT CTATCAGCGT
GGGCTGCGGG ATGAGAGCGC CATCGGGGAG TTCTTCGCTT CTGATTATCG GCTGCACGAT
CCTTTCAGCC TGCGCGGTAT GGAAGCAGCA GTGCAACGGA TCGTCGCGGC TATCCAGAAC
CATGAACCGA TGGCGGTCTA TGGCGATTAT GATACCGATG GCGTAACGGC TGTGACGCTG
CTGGTGCAGG TCATCCGCGC TATGGGCGGA ATGATCCAGC CCTATGTGCC GCACCGTATT
CGTGAAGGGT ATGGTTTGAA CATCGCGGCA ATCGATGCGC TGGTGCGTGA AGGGGTTCGC
CTGCTGATCA CGGTCGATTG TGGCATCAGC AATCTGCGTG AAGCGGAGCA CGCGCGCGCT
GCCGGGCTTG ATCTGATCAT CACCGATCAC CATGCCCCAC CGGCGCGCCT CCCATATGCC
CTGGCAATCG TTAATCCCAA ACAACCGGGG TGCCCGTATC CATTCAAACA TCTGGTCGGC
GTGGGGATCG CCTACCAACT GGCGCGCGCG TTGGTGCGCC GCGGCTTGCG CTCGACGCTC
CAGAAAGATG ATCTGCTCGA TGTCGTCGCC ATTGGTACTG TGACCGATAT GGGACCGCTG
ATCGGCGAGA ATCGCGTGCT GGTGACGCAT GGCTTGAAGG CGATCAATGC CGCGCAACGT
CCCGGAGTGC GCGCGCTGAT CGAAGCCGCC GGTTTGACGC CGGGGAAAGT GACATCGACC
GACATCAGTT TTGGGCTTGG ACCGCGCCTG AATGCGTCGG GACGGATCGA TAACGCGCGA
TTGAGCTACG AACTGTTGCT TGCCGAAGAA CTCGCAATGG CGCAACGCCT GGCGCGCGAA
TTGAATGCGC AGAATCGTCA ACGCCAGGAG TTGTCGAAGA GTGTCCAGGA GCAGGCAAGT
GCGCAAATCA GGGCGTTGGG CAAGCATCAG CAGCGGATTA TCGTGCTCGA CGACGCCGGG
TATCCGTCCG GTGTGGTGGG GTTGGTTGCT GCCCATCTGG TCGAGGAGTA TGGGCGTCCG
ACCGTTCTGA TCGAACGCGG TGAGATGTGG TCGCGCGGCT CAGCGCGTTC AGTTCCCGGT
TTCAGCATCA TCGAAGCGCT GACCGCCTGC GCCGATCTGT TTGAACGCTT TGGCGGGCAT
GCTGAGGCTG CCGGATTCAC CATCGCCACC GACCGCCTGC CGGAGCTTGA AGATCGTCTG
GTGCGGTATG CCGATCAGCG CCTGCCTGAC GATCTGCTGA TCCCCAGCCT TCGGATCGAT
GCCGGGGTTC CGCTTGGCGA GTTGTCATAC GATCTGCTGA ATGAACTGAA GAAGTTGGAA
CCGTTCGGTC AGGGTAATCC ACAACCCGTG CTGATGAGCA GCCGGGTGCA GGTGATCGGC
GCGTGGACGC GCGGGAGTGA GGGGCAGCAC CTGAAATTGC GCCTGACCGA CGCTGGCGGC
AGGGGTCCTT TCAACGCCAT TGCGTTTCGC TTGGGCCACC TGGCGCGCTA TTTCGAGCAA
CCACGCTGGA TCGATGTGGT ATACACTCTC GAAGTCGATG AGTGGAATGG CAGCGATGCG
TTGCAGTTGA ATGTCAAGGA TTTCCGCAGC GCACGATGA
 
Protein sequence
MSTPRLSARN KRWLIHETPR EFIAAWRQFP PLIAAVLYQR GLRDESAIGE FFASDYRLHD 
PFSLRGMEAA VQRIVAAIQN HEPMAVYGDY DTDGVTAVTL LVQVIRAMGG MIQPYVPHRI
REGYGLNIAA IDALVREGVR LLITVDCGIS NLREAEHARA AGLDLIITDH HAPPARLPYA
LAIVNPKQPG CPYPFKHLVG VGIAYQLARA LVRRGLRSTL QKDDLLDVVA IGTVTDMGPL
IGENRVLVTH GLKAINAAQR PGVRALIEAA GLTPGKVTST DISFGLGPRL NASGRIDNAR
LSYELLLAEE LAMAQRLARE LNAQNRQRQE LSKSVQEQAS AQIRALGKHQ QRIIVLDDAG
YPSGVVGLVA AHLVEEYGRP TVLIERGEMW SRGSARSVPG FSIIEALTAC ADLFERFGGH
AEAAGFTIAT DRLPELEDRL VRYADQRLPD DLLIPSLRID AGVPLGELSY DLLNELKKLE
PFGQGNPQPV LMSSRVQVIG AWTRGSEGQH LKLRLTDAGG RGPFNAIAFR LGHLARYFEQ
PRWIDVVYTL EVDEWNGSDA LQLNVKDFRS AR