Gene Rcas_4419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4419 
Symbol 
ID5541932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5676480 
End bp5679611 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content60% 
IMG OID640896517 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001434453 
Protein GI156744324 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.834068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.3832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG ATGCTATTGT CATCAAAGGT GCGCGTGAGC ACAATCTGAA AGGGATCGAC 
CTGGAAATCC CGCGTGATAA ACTGGTTGTT CTGACCGGCG TCTCCGGGTC GGGCAAGTCG
TCGCTGGCGT TCGACACGTT GTACGCCGAA GGGCAGCGCC GATACGTCGA GTCGCTCTCG
GCATATGCAC GGCAGTTTCT CGGTCAGATG GAAAAACCGA AAGTCGATGC CATCGAGGGG
TTGTCGCCCG CGATCGCCAT CGAGCAGAAA AGCGCCTCGA AGAATCCACG CTCGACGGTC
GGCACTGTTA CCGAGATTTA TGACTACCTC CGTTTGCTGT ATGCCCGCGT CGGGACGCAG
CACTGTCACG TGTGTGGTCG TCCGGTCAGT TCGCAGAGCG CCGAGCAGAT GGTCAATCGG
GTGCTGACCC TGCCGACAGG GACGCGCTTT ATGGTGCTGG CGCCGCTTGT GTCGCAGCGC
AAGGGCGAGT ATAAAGATGT CTTCGCGGAA GCGCGCGCCG AGGGGTTCGC GCGGGTGCGC
GTCGATGGCG AGATTTTCGA TCTGGCGGGC GAGATCAAAC TCAATAAGAA GGTCAAGCAT
ACGATTGAGA TTGTCATTGA CCGTCTGGCG ATGCCGGCGC GCGACGCCGC GCGTGATCAG
GTGCGCTCTT CTGACGCTCC CATCGGTCGA GCACAGGAAG GTTCCCAGAG TGAGTGGGAC
GCTTTTGTGA CCCGACTGAC CGATAGTGTC GAGCAGGCGC TGCGGGTCGG CGAGGGGCAG
TTGATCATCA GCATTCAGAA CAAAACGGGC GCCGCCGAAG AATGGTTGAT GAGCGAAGCC
AATACTTGCA CGCACTGCGG TATTTCGTTC CCTGAACTCT CGCCGCAAAT GTTCTCGTTC
AACAGTCCGC AGGGCGCCTG CCCCGAATGC ACCGGTCTTG GCGTCCGGCT CGAAGTCGAT
CCGCTCCTGC TCGTTCCCAA CCCATCGCTG ACCCTGCACG AAGGTGCGGT GACGTATTGG
GGCGAACTGC GCAAGAAGCG TGACTCGTGG GGGTACCGCG CGCTGCTGGC GATTGCCCGG
CACTATGGAT TCGATCTCGA TACGCCTTGG GAACAACTCA GCGAACAGGC GCGTCACGTC
ATCATCTATG GCAGCGGGAA AGAGCGGATA CGTTTCCAGT GGGGTGATGA AGGCGGCGAT
AGCCGAGGTG AGTTTACCCG CACCTGGGAA GGGCTGGCAA GCGAGATTCG CCGCCGTTTT
CAGCAGACGG GCAGCGACTA CACACGCGAG TATTACCAGA GTTTTATGAG TGAGCAACCC
TGCCCGGCGT GCAGCGGCGC GCGCCTGCGC CCCGAAAGCC TGGCGGTCCG GGTTGGCGGA
TTGTCAATCC GCGACGTGAC ACGGATGACG ATTGCCGGGG CACTGGCATG GGTGAATGCC
CTGAGCGGCA TTTCCGGCAA CATCGCGCAT CTGGCAGACC TGGAGGGTCA GGTGATGCCC
GGGGTTGTGG CAGGAAATGG CGCTGCGCAT CACGGTGCAG TGACGCCATT GACCGATTAT
CAGATGGCGA TTGTCAACGA TGTGCTGAAA GAAATTCGTG AACGGCTCGG CTTTCTGCTG
AATGTCGGTC TTCATTACCT GACGCTGGAA CGTCCCGCGC CGACGCTCTC CGGCGGTGAG
GCGCAACGCA TCCGTCTCGC ATCGCAGATC GGCTCTGGTC TTGTGGGTGT AACGTACATT
CTCGACGAGC CGAGCATCGG GCTACATCAG CGCGACAATC GCAAACTCCT CGATACGCTG
CTCAAACTGC GCGACCTGGG CAACACCGTC GTGGTGGTTG AGCACGATCT GGAAACCATG
CAGGCGGCTG ACTGGATCAT CGATTTTGGT CCGGGCGCCG GGGTCAAGGG CGGTCAGGTC
GTCGCCGCTG GTCCGCCCAA TGTCGTGGCG GCATCGCCCG AGTCGCTGAC CGGTGCGTAT
CTCGCGGGGC GACTTGAGAT CCCTACGCCG CAGCAGCGCC GCACTGCGCG GGTGCGTCCG
GTTGCCAATG GATTGCAGGA TGCGCCGCGT CGTCGCCGGG TAGATCATCA GTCCGATCTG
GCGGAGGGAC CGTGGCTCGA ACTCGAAGGC GCCACGATGA ACAACCTGCG TGATGTGACT
GTTCGCTTTC CGCTGGGGGT CTTTATGTGC GTCACCGGAG TGTCCGGTTC GGGTAAGTCG
TCACTGATCA CCGAGACGCT CTACCCGGCG CTGGCGAATC GCCTGCATCG CGCTCAGTTG
AAGCCGGGAC CATTCCGCGC GCTGCGCGGG TTGGAGCATC TCGATAAGGT GATCGATATC
GATCAGCAAC CAATCGGGCG GACGCCGCGC TCGAACCCGG CGACGTATGT CAAACTGTTC
GACCTGATTC GTGAACTATT CGCCTCGACT AATGAAGCGA AACTACGTGG CTATAACGCC
GGTCGCTTCT CGTTCAACCT GAAGGGCGGG CGCTGCGAAG CCTGCGAGGG CAATGGCGAA
AAACGCATCG ATATGCAGTT CCTGGCAGAT GTCTGGGTGC GCTGCGATGT CTGCAAGGGC
AAACGGTACA ACCGCGAAAC GTTGCAGGTC AAGTACAAAG GCAAATCAAT CGCCGATGTC
CTTGATATGG ACGTTCAGAC GGCGCTGGAG TTCTTCGACA ATGTGCCGCG TATCAAGCGC
ATGCTCCAGA CGTTGCACGA TGTCGGGCTG GACTACATCA AACTCGGGCA ATCGGCGACG
ACCCTTTCCG GCGGCGAGGC GCAACGGGTG AAACTTGCGA AAGAACTGGC GCGCGTTGCA
ACCGGTCGTA CCATGTATAT TCTCGATGAA CCGACCACCG GGTTGCACTT TGCCGATGTG
CAGCGCCTGC TCACCGTGCT GCACCGTCTT GTGGATGCGG GCAACACCGT GCTCGTCATT
GAACACAACC TCGATGTGAT TAAGACCGCA GACTGGATCA TCGACATGGG ACCGGAGGGC
GGCGACGGCG GCGGCACGGT CGTCGCCGTC GGCACGCCTG AAGAGGTCGC CATGATCGAG
GCATCGCACA CGGGACGGTT CCTGCGCGAG ATTCTATATG CGACTGGGGT TAAGGGTGTG
GCGCAAGATT AA
 
Protein sequence
MAKDAIVIKG AREHNLKGID LEIPRDKLVV LTGVSGSGKS SLAFDTLYAE GQRRYVESLS 
AYARQFLGQM EKPKVDAIEG LSPAIAIEQK SASKNPRSTV GTVTEIYDYL RLLYARVGTQ
HCHVCGRPVS SQSAEQMVNR VLTLPTGTRF MVLAPLVSQR KGEYKDVFAE ARAEGFARVR
VDGEIFDLAG EIKLNKKVKH TIEIVIDRLA MPARDAARDQ VRSSDAPIGR AQEGSQSEWD
AFVTRLTDSV EQALRVGEGQ LIISIQNKTG AAEEWLMSEA NTCTHCGISF PELSPQMFSF
NSPQGACPEC TGLGVRLEVD PLLLVPNPSL TLHEGAVTYW GELRKKRDSW GYRALLAIAR
HYGFDLDTPW EQLSEQARHV IIYGSGKERI RFQWGDEGGD SRGEFTRTWE GLASEIRRRF
QQTGSDYTRE YYQSFMSEQP CPACSGARLR PESLAVRVGG LSIRDVTRMT IAGALAWVNA
LSGISGNIAH LADLEGQVMP GVVAGNGAAH HGAVTPLTDY QMAIVNDVLK EIRERLGFLL
NVGLHYLTLE RPAPTLSGGE AQRIRLASQI GSGLVGVTYI LDEPSIGLHQ RDNRKLLDTL
LKLRDLGNTV VVVEHDLETM QAADWIIDFG PGAGVKGGQV VAAGPPNVVA ASPESLTGAY
LAGRLEIPTP QQRRTARVRP VANGLQDAPR RRRVDHQSDL AEGPWLELEG ATMNNLRDVT
VRFPLGVFMC VTGVSGSGKS SLITETLYPA LANRLHRAQL KPGPFRALRG LEHLDKVIDI
DQQPIGRTPR SNPATYVKLF DLIRELFAST NEAKLRGYNA GRFSFNLKGG RCEACEGNGE
KRIDMQFLAD VWVRCDVCKG KRYNRETLQV KYKGKSIADV LDMDVQTALE FFDNVPRIKR
MLQTLHDVGL DYIKLGQSAT TLSGGEAQRV KLAKELARVA TGRTMYILDE PTTGLHFADV
QRLLTVLHRL VDAGNTVLVI EHNLDVIKTA DWIIDMGPEG GDGGGTVVAV GTPEEVAMIE
ASHTGRFLRE ILYATGVKGV AQD