Gene RoseRS_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3848 
Symbol 
ID5210830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4806619 
End bp4809546 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content63% 
IMG OID640597444 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001278152 
Protein GI148657947 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00424286 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000162445 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCTGCTG ATCGGATCGT GGTGCGCGGG GCGCGCGTCC ATAATCTTAA GAATATCACG 
GTCGCCATGC CGCGCAATGC GCTGGTGGTG ATCACCGGTC TCTCCGGCTC CGGCAAGTCG
TCGCTGGCGT TCGATACGAT TTTTGCCGAA GGTCAGCGTC GCTATGTCGA ATCGCTTTCC
GCCTATGCCC GCCAGTTCCT TGGTCAGATC GACAAGCCGG ACGTTGACGC CATCGAAGGA
TTGTCGCCTG CCATTGCCAT CGATCAGAAA GGGCTGGCGC GCAATCCGCG CTCAACCGTC
GGCACCATTA CCGAAATCTA CGATTATCTG CGACTGCTGT TTGCCCGGAT TGGACGACCG
CACTGCATCC ACTGCGGTCG TCCGCTCACG CGCCAGTCGG CGCAACAGAT GATCGATACG
ATCCTCGGTC TGCCGGGTGG CAGCCGCGTG CTGCTGCTTG CGCCCCTCGT TCGCGATCAG
AAGGGCGACC ATCAGACCCT CTTCGATCAG GTGCGCAAGC AGGGGTTCGT CCGTGTGCGC
GTCGATGGCG AGGTGCGCGA TCTTGGCGAC GACCTGCGCC TTGACCGTCA TCGTCCGCAT
ACGATCGAGG TCGTTGTGGA CCGGCTGGTC ATTCCATCAG CCGATCCCTC GCAGCAACAG
ACGCAGTTTC GCGTGCGCGT CGCCGATTCG GTCGAAACGG CGCTGCGGGT CGGCGGCGGC
GTGGTGATCG TCCAGATCGT CGGCGGTGAT GAACTCACTC TGTCGCAGCG TTACGCCTGC
CCGGTGCACG GACCGGCTTC CATCGGGGCG CTCGAACCGC GCGATTTTTC ATTCAACAAT
CCATCGGGCG CCTGCCCTGC CTGTGATGGT CTGGGAAGCG TGCTGGAGTT CGACCCTGAA
CTGGTGATCC CCGACCGATC CCGTTCACTT GCGGAAGGCG CCGTCGCGCC CTGGTCGAAC
GTCAGTCGCG CGCAGCGCCG CTACTTCGAC GATCTGCTGA CATCACTCGC CGGGCACCTC
GGTTTTTCAC TGCACACACC AGTGCGCGAT CTGCGCCCGG AGGTGATCGC TACGCTGCTC
TACGGTTCTA ACGGCGATGT GATGCCACTC CGTTACCACA TACGCGGCGA AGAGCGCATG
GTTGAGGCGC CGTTCGAGGG GGTTATCCCC GGTCTGCGCC GTCGTCTGGC GGAATGCACC
GATGAAACCG AACGCGCACA GATTGAGCAG TTTATGACCC CGCGCACATG CCCGGCATGC
AACGGCGCGC GACTGCGTCC CGAACTGCTC GCCGTCACCG TCGCCGGATA CACGATTGCG
CAGGTGGCGG CGCTCCCTGT CGCTGAAGCG TGGTCGTGGG CGAAAACACT GGCTGCCGAC
GTTGACGAGG TGGTCGCCCG CTGGCGTGAG ACGCGCGAAA GCGACCTGCG CTCGTCCATC
TATGCTCTTA CGGTGCGCGA ATGTCAGATC GCGGCGCCAA TCCTGAATGA CATCTGCGCC
CGACTCCGGT TTCTAAATGA AGTCGGGCTG GGGTATCTCA CGCTTGACCG CACTGCGACG
ACCCTTGCTG GCGGCGAGGC GCAGCGCATT CGCCTGGCGA CGCAGATCGG CGCCGGGCTG
AGCGGTGCGC TCTATGTGCT GGACGAGCCG AGCATCGGGT TACACCCACG TGATACGGCG
CGCCTGCTCA ACACGTTGCG TCAACTGCGC GACCTGGGCA ACAGCGTGCT GGTTGTTGAA
CACGACGAAG AGATCATTCG CGCAGCCGAC TGGATCGTTG ACATTGGTCC CGGTGCGGGA
GAGCGCGGCG GCGAGGTGAT CGTCAGCGGT CCGCTCGATG CAGTGCTGGC AGAGCCGCGC
TCACTTACCG GACAGTACCT CTCCGGCAAA CGCACGATTG CCGTGCCACG CCGCCGACGA
CCCGGCAACG GCGCATTCCT GATGATCAGG GGAGCGCGCG AGCACAACCT GAAGAACATC
GACGTCGCCA TCCCGCTGGG ATGTCTGGTG GCAGTCACCG GCGTCAGCGG GTCTGGCAAA
TCGACCCTGA TCAACGACAC CCTCTACCCG CGGCTGGCGC AGGCGCTCCA TGGCGCGCGC
GCGCGCCCCG GCGCCCACGA CGCAATCTAT GGCATCGAAC AGATCGATAA GGTGATCGAC
ATCGACCAGT CGCCCATCGG GCGCACGCCA CGCTCCAACC CGGTCACCTA CACCAAGGCG
TTCGACCCGA TCCGCAAATT GTTCGCGCAA ACGCCTGAAG CGCGCGCGCG CGGCTATGAT
GCCAGTCGTT TCTCGTTCAA CATTCCCGGA GGACGGTGCG AGCACTGCAA CGGCGAAGGA
TTGATGCAGA TCGAAATGCA GTTCCTTCCC GATCTGTACG TCACCTGCGA TGTCTGTCAC
GGCGCGCGCT ACAACCGTGA AACGCTCGAC ATTCGTTACC GGGGCAAGAA CATCGCCGAA
GTGCTCGACA TGACGGTTGA AGAAGCGGCG GCATTTTTCG AGCGTGTCCC ATCCATCGCC
GAAAAGTTGC AAACCCTGAT CGACGTCGGG TTAGGGTATA TTCGCCTGGG TCAACCTGCG
ACCACCCTCT CCGGCGGTGA GGCGCAACGC ATCAAACTGG CGACCGAACT GAGTCGGCGC
GCCACCGGGC GCACCCTCTA CATCCTCGAT GAACCGACGA CCGGCCTGCA CGTCGCTGAT
GTTGACCGGT TGCTGCGCGT CTTGCAGCGC CTGGTGGACG CGGGAAACAC CGTTCTGGTG
ATCGAACACC ATCTCGATGT GATCAAATGC GCCGATTGGG TGATCGACCT GGGACCGGAG
GGAGGCGAGG AGGGCGGGCG TGTTGTCGCC GTCGGCACGC CTGAACAGGT CGCGCGAACG
CCAGGATCGT ACACCGGTCA GTGCCTGGCG CGGGTGGTTG AAGGTTGA
 
Protein sequence
MSADRIVVRG ARVHNLKNIT VAMPRNALVV ITGLSGSGKS SLAFDTIFAE GQRRYVESLS 
AYARQFLGQI DKPDVDAIEG LSPAIAIDQK GLARNPRSTV GTITEIYDYL RLLFARIGRP
HCIHCGRPLT RQSAQQMIDT ILGLPGGSRV LLLAPLVRDQ KGDHQTLFDQ VRKQGFVRVR
VDGEVRDLGD DLRLDRHRPH TIEVVVDRLV IPSADPSQQQ TQFRVRVADS VETALRVGGG
VVIVQIVGGD ELTLSQRYAC PVHGPASIGA LEPRDFSFNN PSGACPACDG LGSVLEFDPE
LVIPDRSRSL AEGAVAPWSN VSRAQRRYFD DLLTSLAGHL GFSLHTPVRD LRPEVIATLL
YGSNGDVMPL RYHIRGEERM VEAPFEGVIP GLRRRLAECT DETERAQIEQ FMTPRTCPAC
NGARLRPELL AVTVAGYTIA QVAALPVAEA WSWAKTLAAD VDEVVARWRE TRESDLRSSI
YALTVRECQI AAPILNDICA RLRFLNEVGL GYLTLDRTAT TLAGGEAQRI RLATQIGAGL
SGALYVLDEP SIGLHPRDTA RLLNTLRQLR DLGNSVLVVE HDEEIIRAAD WIVDIGPGAG
ERGGEVIVSG PLDAVLAEPR SLTGQYLSGK RTIAVPRRRR PGNGAFLMIR GAREHNLKNI
DVAIPLGCLV AVTGVSGSGK STLINDTLYP RLAQALHGAR ARPGAHDAIY GIEQIDKVID
IDQSPIGRTP RSNPVTYTKA FDPIRKLFAQ TPEARARGYD ASRFSFNIPG GRCEHCNGEG
LMQIEMQFLP DLYVTCDVCH GARYNRETLD IRYRGKNIAE VLDMTVEEAA AFFERVPSIA
EKLQTLIDVG LGYIRLGQPA TTLSGGEAQR IKLATELSRR ATGRTLYILD EPTTGLHVAD
VDRLLRVLQR LVDAGNTVLV IEHHLDVIKC ADWVIDLGPE GGEEGGRVVA VGTPEQVART
PGSYTGQCLA RVVEG