Gene Rcas_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0572 
Symbol 
ID5538035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp763756 
End bp766686 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content63% 
IMG OID640892733 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001430719 
Protein GI156740590 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.199492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCTG ACTGGATCGT GGTGCGCGGA GCGCGCGTCC ACAATCTTAA GAATATCACG 
GTCGCCATGC CGCGCAATGC GCTGGTGGTG ATCACGGGCC TCTCCGGCTC CGGTAAGTCG
TCGCTGGCAT TCGACACCAT TTTTGCCGAG GGGCAGCGTC GCTATGTCGA GTCGCTCTCC
GTCTATGCGC GCCAGTTCCT CGGTCAGATC GATAAGCCGG ATGTCGATGC GATTGAAGGG
TTGTCGCCTG CGATTGCCAT CGACCAGAAG GGTTTGGTGC GCAATCCGCG CTCGACGGTC
GGCACGGTCA CCGAAATTTA CGATTACCTT CGCTTGCTCT TCGCCCGGAT TGGACGACCG
CACTGCGTTC ACTGCGGTCG TCCGTTGATC CGCCAGTCGG CGCAGCAGAT GATCGATACG
ATCCTCGATC TGCCTCCCGG CAGTCGCATT CTGCTGCTGG CGCCGCTCGT GCGCGATCAG
AAGGGCGACC ATCAGCCGCT CCTCGATCAG GTGCGCAAAC AGGGGTTTGT GCGCGTGCGC
GTCGATGGCG AGGTGCGCGA CCTGGCGGAC GATCTGCGCC TGGATCGCTA CCGCCCCCAT
ACCATCGAGG TGGTCGTTGA TCGTCTGGTC ATACCGCAAT CCGATACAGC GCCGCATCAA
TCCCAGTTGC GGGTGCGCGT CGCCGATTCG GTCGAAATGG CGCTGCGGGT TGGTGATGGG
GTGGTGATCG TGCAGATCGT TGGCGGCGAT GAACTGCCCC TCTCGCAACG GTATGCCTGC
CCGGTGCATG GTCCTGCAAC GATTGGGGCG CTCGAACCGC GCGATTTTTC GTTCAATAAT
CCGTCCGGCG CCTGCGCGAC GTGCGACGGT CTCGGCAGTG TGCTGGAGTT CGATCCCGAT
CTGGTCATTC CAGACCGCTC ACGTTCGCTG GCGGACGGCG CCATCGCGCC ATGGGCGAAT
GTCAGTCGCG CACAGCGCCG CTACTTCGAC GATCTGCTGG CATCGCTCGC CGATCACCTG
GGTTTTTCGC CGCACACGCC GCTGCGCGAT CTTCCTCCCG AAGTGATTGC GACAATCCTC
TACGGCTCTA ACGGTGATGT GATGCCGTTG CGCTACCGGC TGCGCGGCGA GGAGCGTCTC
GTCGAGGCGC CATTCGAGGG CGTAATTCCG GCGTTGCGCC GACGGCTGGG GGAGTGCTCC
GATGAAACGG AACGCGCGCA GATCGAGCAG TTCATGACGC CGTGCGTGTG TCCGGCATGC
AACGGCGCGC GCCTGCGCCC CGAATTGCTC GCCGTCACCG TCGCCGGATA CACGATTGCG
CAGGTGTCGG CGCTGCCCGT CGCTGAAGCA TGGTCGTGGG CGAAAACGCT GGCTGCCGAC
GTCGCAGCGG CCGTCTCCTG CTGGCGCGAG ACGCGCGAAA GCAATCTGCG CTCGTCAATC
TATGCGCTGA ATGTGCGCGA ATGTCAGATT GCAGCGCCCA TCCTGAACGA CATCTGCGCG
CGGCTCCGAT TCCTGAACGA GGTTGGGCTG GAGTATCTCG CGCTGGATCG CGCCGCCGCG
ACCCTCTCCG GCGGAGAAGC GCAGCGTATC CGCCTTGCGA CACAGATCGG GTCCGGGTTG
AGCGGCGCGC TCTACGTGCT GGACGAGCCG AGCATTGGGC TCCACCCGCG TGATACGGCG
CGCCTGCTCA ATACGCTGCG ACGGCTGCGC GACCTGGGGA ATAGTGTGCT GATCGTCGAA
CACGACGAGG AAATCATCCG CGCCGCCGAC TGGATCGTCG ATATTGGTCC TGGCGCAGGG
GAGCGCGGCG GCGAGGTGAT CGTCAGCGGA CCGTTCGAGG CAGTGCTGGC AGAGCCGCGC
TCGCTAACCG GGCAGTATCT CTCCGGCAAA CGCGCGATTC CTGTGCCGCG CCGACGGCGC
TCCGGCAGCG GCAGGTTTTT GATGATCAAA GGGGCGCGTG AGCACAATCT GAAGCATATC
GATGTCGCCA TTCCACTAGG ATGCCTGGTT GCCATCACCG GTGTCAGCGG CTCCGGTAAA
TCCACCCTGG TCAACGACAC CCTCTACCCG CGACTGGCGC AGGCGCTCCA TGGCGCGCGC
GCGCGCCCCG GCGCCCACGA CGCGATCTAC GGCATTGAAC ATATCGATAA GGTGATCGAC
ATCGACCAGT CGCCGATCGG TCGCACGCCG CGTTCCAATC CGGTCACCTA CACCAAAGCC
TTTGACCCGA TCCGCAAGTT GTTTGCGCAA ACCCCCGAAG CGCGCGCGCG CGGCTATGAC
GCCGGTCGTT TTTCGTTCAA CATTCCCGGC GGGCGCTGCG AACATTGCAA CGGCGAAGGG
TTGATGCAGA TCGAGATGCA GTTCCTGCCG GACCTCTACG TGACCTGCGA TGTGTGCCAT
GGCGCGCGCT ACAACCGTGA GACGCTTGAC ATCCGCTATC GCGGCAAAAA TATTGCTCAG
GTGCTCGATA TGACCGCTGA GGAAGCGGCG GCGTTCTTCG AGCGCGTGCC TGCCATTGCC
GAAAAATTGC AGACGTTGAT CGACGTGGGG TTGGGCTACA TTCGCCTCGG TCAACCGGCA
ACCACGCTGT CCGGCGGCGA AGCGCAGCGC ATCAAACTGG CGACTGAACT GAGCCGCCGC
GCCACCGGAC GCACCCTCTA CATCCTGGAC GAGCCAACCA CCGGATTACA CGTCGCCGAC
GTCGACCGGC TGCTGCGTGT GTTGCAGCGG TTGGTCGATG CGGGCAACAC TGTGCTGGTC
ATTGAACATA ACCTCGACGT TATCAAGTGC GCCGACTGGG TCATCGACCT TGGTCCCGAA
GGCGGCGATG CTGGCGGGCG CGTCGTCGCC GCCGGAACTC CCGAACAGGT GGCGCGAACG
CCAGGATCGC ACACCGGTCA GTGTCTGGCG CGCATACTCG TTGAACGTTG A
 
Protein sequence
MSADWIVVRG ARVHNLKNIT VAMPRNALVV ITGLSGSGKS SLAFDTIFAE GQRRYVESLS 
VYARQFLGQI DKPDVDAIEG LSPAIAIDQK GLVRNPRSTV GTVTEIYDYL RLLFARIGRP
HCVHCGRPLI RQSAQQMIDT ILDLPPGSRI LLLAPLVRDQ KGDHQPLLDQ VRKQGFVRVR
VDGEVRDLAD DLRLDRYRPH TIEVVVDRLV IPQSDTAPHQ SQLRVRVADS VEMALRVGDG
VVIVQIVGGD ELPLSQRYAC PVHGPATIGA LEPRDFSFNN PSGACATCDG LGSVLEFDPD
LVIPDRSRSL ADGAIAPWAN VSRAQRRYFD DLLASLADHL GFSPHTPLRD LPPEVIATIL
YGSNGDVMPL RYRLRGEERL VEAPFEGVIP ALRRRLGECS DETERAQIEQ FMTPCVCPAC
NGARLRPELL AVTVAGYTIA QVSALPVAEA WSWAKTLAAD VAAAVSCWRE TRESNLRSSI
YALNVRECQI AAPILNDICA RLRFLNEVGL EYLALDRAAA TLSGGEAQRI RLATQIGSGL
SGALYVLDEP SIGLHPRDTA RLLNTLRRLR DLGNSVLIVE HDEEIIRAAD WIVDIGPGAG
ERGGEVIVSG PFEAVLAEPR SLTGQYLSGK RAIPVPRRRR SGSGRFLMIK GAREHNLKHI
DVAIPLGCLV AITGVSGSGK STLVNDTLYP RLAQALHGAR ARPGAHDAIY GIEHIDKVID
IDQSPIGRTP RSNPVTYTKA FDPIRKLFAQ TPEARARGYD AGRFSFNIPG GRCEHCNGEG
LMQIEMQFLP DLYVTCDVCH GARYNRETLD IRYRGKNIAQ VLDMTAEEAA AFFERVPAIA
EKLQTLIDVG LGYIRLGQPA TTLSGGEAQR IKLATELSRR ATGRTLYILD EPTTGLHVAD
VDRLLRVLQR LVDAGNTVLV IEHNLDVIKC ADWVIDLGPE GGDAGGRVVA AGTPEQVART
PGSHTGQCLA RILVER