Gene Rcas_3875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3875 
Symbol 
ID5541380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5067112 
End bp5068611 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content64% 
IMG OID640895985 
ProductRecB family-like nuclease 
Protein accessionYP_001433929 
Protein GI156743800 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00135732 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0591762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTGTG AGCGGCGTGT CTGGCTCGAC GGGCGCAGCG CTCAACCGCC GCAGACTGCC 
CCCACTGCCG ATGCCCGGGC ACGCATGACG CTGCGTGCCA ACCACAAATG CCACATCCTC
GCTGCAATTG AAGGAGTCGA AGATTTCTCA GAACTTCCAT GGAACGAACG GGTGACCCAG
ACACGCCTCG CAATGCAACG CGGCGCAGCA ATGATAAGCG GCGCCGCGCT CGAAGCGCCA
ATCGGCGGTC GCCTGCTACA TGGCGCTCCC GACCTTCTGC AACGCCAGAC TACTGCCGCA
TCAGGCGCAT GGATCTACGA GCCGATTGCC ATTGTGCTGC ATACCCGACC CACCCGCTGG
GAACGGCTCT TGCTCGACTC GTGGCGCTGG CTCGTCCGGC AAACGCAGGG ATGGGATCAT
GACCCGCCGG GTGAACTGTG GCTCGGCGCG AACGGCTATA GACCAGTATG TATCAAACGC
CAGACCGCAT CGCTCGCTGC GTTCACTGCA CAGATACGGC GCGCCATTGC GATTGCCGCC
GGGGAAGCCC CACCGATCTG GTTCGATAGC GACCATTGCC CGTTTTGTCC ATGGCGCACC
TCGTGTGATG CCGCAGCGCA TGACACCCAC GACATCGCCC TGATCCCCAG ACTGAGTCGG
CGGCAAGGGA GCGCCCTACG GCGGCTTGGC ATCCATCGCA TCGATCAGGT CATAACGCTC
GATCCTGCAA CCATGACGAC ACTGCCGGAC CACTCGTCCG CAACGGAGGC ACGTCTCCGA
CGCCAGGCGC AGGCATTGCT GACCGATCAA CCATTACCGG TCAGCGCCGC CATTCCGTCT
CTGCCGCAAG TCCGGCTCTT TCTTGACATC GAGAGCGATC CGCAGACGCG CGAGCCGTGG
GCATTCGGGC TTGCCGGCGC ACCCGGCGAC CGCTTCGTGA TTGTCGTGGA ACCCCTCGTG
GCAGGCAGCG ACGTTCGGCT CAACGGCATC CCTGTCATTG GGGTGCACAG TGCGCATGAG
GGTTGGCAGC GCGTGCTCCA CGCCGTGCGC GCGACAGGAG GTGCGCTGGC ACACTGGGGA
GAAGCCGAGC GCCTGATGCT CGAGCAGAGT GCGGATCCGC ACACATACCA GGAACTCATC
CCCCTCATGA TCGATGCACA ACGCGAACTG TACAAGCGCG TCGTACTTCC GACACCCCGC
CAGAGCGACC AGCGCGGCGG CGGATTAAAA GCCGCTGCGC GCTGGCTGGG GTGGAGGTGG
TCCCCAGGCG CCGATCACTG GACGCTGGCA TGGGAGGCAT ACCGGCAGTG GCGGGCACAA
CCATCGCCAG CCAATGTTTT CGATAGACTG ACGCCGGCAA TCGTCTATCT GGCAACCGAT
GTCGAGGCGC TGGCGGCAGT CTGGCGCTGG CTCGACGCTT TTGTGGCATC GATCAATGCA
ACCAGCGCGC CTCAAGACGG GCATGCAGAT CGAGATACCG CAACCAACCG CGAATGCTGA
 
Protein sequence
MRCERRVWLD GRSAQPPQTA PTADARARMT LRANHKCHIL AAIEGVEDFS ELPWNERVTQ 
TRLAMQRGAA MISGAALEAP IGGRLLHGAP DLLQRQTTAA SGAWIYEPIA IVLHTRPTRW
ERLLLDSWRW LVRQTQGWDH DPPGELWLGA NGYRPVCIKR QTASLAAFTA QIRRAIAIAA
GEAPPIWFDS DHCPFCPWRT SCDAAAHDTH DIALIPRLSR RQGSALRRLG IHRIDQVITL
DPATMTTLPD HSSATEARLR RQAQALLTDQ PLPVSAAIPS LPQVRLFLDI ESDPQTREPW
AFGLAGAPGD RFVIVVEPLV AGSDVRLNGI PVIGVHSAHE GWQRVLHAVR ATGGALAHWG
EAERLMLEQS ADPHTYQELI PLMIDAQREL YKRVVLPTPR QSDQRGGGLK AAARWLGWRW
SPGADHWTLA WEAYRQWRAQ PSPANVFDRL TPAIVYLATD VEALAAVWRW LDAFVASINA
TSAPQDGHAD RDTATNREC