Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3875 |
Symbol | |
ID | 5541380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5067112 |
End bp | 5068611 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640895985 |
Product | RecB family-like nuclease |
Protein accession | YP_001433929 |
Protein GI | 156743800 |
COG category | [R] General function prediction only |
COG ID | [COG2251] Predicted nuclease (RecB family) |
TIGRFAM ID | [TIGR03491] RecB family nuclease, putative, TM0106 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00135732 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0591762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTGTG AGCGGCGTGT CTGGCTCGAC GGGCGCAGCG CTCAACCGCC GCAGACTGCC CCCACTGCCG ATGCCCGGGC ACGCATGACG CTGCGTGCCA ACCACAAATG CCACATCCTC GCTGCAATTG AAGGAGTCGA AGATTTCTCA GAACTTCCAT GGAACGAACG GGTGACCCAG ACACGCCTCG CAATGCAACG CGGCGCAGCA ATGATAAGCG GCGCCGCGCT CGAAGCGCCA ATCGGCGGTC GCCTGCTACA TGGCGCTCCC GACCTTCTGC AACGCCAGAC TACTGCCGCA TCAGGCGCAT GGATCTACGA GCCGATTGCC ATTGTGCTGC ATACCCGACC CACCCGCTGG GAACGGCTCT TGCTCGACTC GTGGCGCTGG CTCGTCCGGC AAACGCAGGG ATGGGATCAT GACCCGCCGG GTGAACTGTG GCTCGGCGCG AACGGCTATA GACCAGTATG TATCAAACGC CAGACCGCAT CGCTCGCTGC GTTCACTGCA CAGATACGGC GCGCCATTGC GATTGCCGCC GGGGAAGCCC CACCGATCTG GTTCGATAGC GACCATTGCC CGTTTTGTCC ATGGCGCACC TCGTGTGATG CCGCAGCGCA TGACACCCAC GACATCGCCC TGATCCCCAG ACTGAGTCGG CGGCAAGGGA GCGCCCTACG GCGGCTTGGC ATCCATCGCA TCGATCAGGT CATAACGCTC GATCCTGCAA CCATGACGAC ACTGCCGGAC CACTCGTCCG CAACGGAGGC ACGTCTCCGA CGCCAGGCGC AGGCATTGCT GACCGATCAA CCATTACCGG TCAGCGCCGC CATTCCGTCT CTGCCGCAAG TCCGGCTCTT TCTTGACATC GAGAGCGATC CGCAGACGCG CGAGCCGTGG GCATTCGGGC TTGCCGGCGC ACCCGGCGAC CGCTTCGTGA TTGTCGTGGA ACCCCTCGTG GCAGGCAGCG ACGTTCGGCT CAACGGCATC CCTGTCATTG GGGTGCACAG TGCGCATGAG GGTTGGCAGC GCGTGCTCCA CGCCGTGCGC GCGACAGGAG GTGCGCTGGC ACACTGGGGA GAAGCCGAGC GCCTGATGCT CGAGCAGAGT GCGGATCCGC ACACATACCA GGAACTCATC CCCCTCATGA TCGATGCACA ACGCGAACTG TACAAGCGCG TCGTACTTCC GACACCCCGC CAGAGCGACC AGCGCGGCGG CGGATTAAAA GCCGCTGCGC GCTGGCTGGG GTGGAGGTGG TCCCCAGGCG CCGATCACTG GACGCTGGCA TGGGAGGCAT ACCGGCAGTG GCGGGCACAA CCATCGCCAG CCAATGTTTT CGATAGACTG ACGCCGGCAA TCGTCTATCT GGCAACCGAT GTCGAGGCGC TGGCGGCAGT CTGGCGCTGG CTCGACGCTT TTGTGGCATC GATCAATGCA ACCAGCGCGC CTCAAGACGG GCATGCAGAT CGAGATACCG CAACCAACCG CGAATGCTGA
|
Protein sequence | MRCERRVWLD GRSAQPPQTA PTADARARMT LRANHKCHIL AAIEGVEDFS ELPWNERVTQ TRLAMQRGAA MISGAALEAP IGGRLLHGAP DLLQRQTTAA SGAWIYEPIA IVLHTRPTRW ERLLLDSWRW LVRQTQGWDH DPPGELWLGA NGYRPVCIKR QTASLAAFTA QIRRAIAIAA GEAPPIWFDS DHCPFCPWRT SCDAAAHDTH DIALIPRLSR RQGSALRRLG IHRIDQVITL DPATMTTLPD HSSATEARLR RQAQALLTDQ PLPVSAAIPS LPQVRLFLDI ESDPQTREPW AFGLAGAPGD RFVIVVEPLV AGSDVRLNGI PVIGVHSAHE GWQRVLHAVR ATGGALAHWG EAERLMLEQS ADPHTYQELI PLMIDAQREL YKRVVLPTPR QSDQRGGGLK AAARWLGWRW SPGADHWTLA WEAYRQWRAQ PSPANVFDRL TPAIVYLATD VEALAAVWRW LDAFVASINA TSAPQDGHAD RDTATNREC
|
| |