Gene Sare_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4131 
Symbol 
ID5708111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4692735 
End bp4696025 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content75% 
IMG OID641273559 
ProductUvrD/REP helicase 
Protein accessionYP_001538912 
Protein GI159039659 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.81155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.92786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCGCAG GGCGGATCGA CGACCCCCGA CAGCTGAAGG TGGTCGGACA CACCGCCGGG 
CCGATGCTGG TGCTCGGCGG GCCGGGCACC GGCAAGACCA CGACCCTGGT CGAGGCGGTC
GCGGCGCGGG TGGCCGAAGG CGTGGATCCG GAGCGCGTCC TGGTCCTCAC CTTCAGTCGC
CGGCAGGCTG CCAGCCTGCG TCAGCGCATC GAGGCCCGGA TCGCCGGTGA CGGCCATCGG
GTGCTGCGGG AACCACTGGT ACGCACCTTC CCGGCGTACG CGTTCGGGCT GCTGCGCCGC
GCCGCGGCGG AACGCGGCGA GCCGTCACCC CGGCTGCTCA CCGGTCCCGA GCAGGACCTG
ATCATTCGGG AACTGCTCGA TCTGGTGGGT GAGGAGCCGG ACAACGACCC GGTGGGCTGG
CCGGAGGACC TACGCCCGGC GTTGCGTACC CGGGCCTTCG CGGCCCAGCT GCGGGACCTG
CTGATGCGGG CCGCCGAGCG CGGCATTGGC CCGGTGGAGC TGGCTCGGCT GGGCGAGCGG
CTCGGCCGGG CCGACTGGCC GGCCGCCGCG CGGTTCCTCC GGGAGTACGT CGCCGTGCTC
GCACTGCGCG ATGTCGGTAA CCGCGGCTCG ATCGCGTACG ATCCGGCCGA ACTGGTCCGG
TCCGCCACCG GAATGCTGCT CGACGATCCG GCGCTGCTGG CCGCGGAACG GCGCCGGCTG
GCACACGTCT ACGTGGACGA GTACGCCGAC ACCGACCCTG CCCAGCAGGA CCTGCTGGCG
ACCGTGGCGG GTGACGGTAA GGCCCTGGTC GTCCTCGCCG ACCCCGACTC GTCCACGTAC
GCCTTCCGCG GTGCCGATCC GGGCGGTGTG GTGACCTTCC CGCACCGGTT CCGTACCGCC
TCGGGGGCAC CCGCGGCGCA GGTCGTGTTC ACCACGTCCT ACCGGGCCGC TCCCCGGCTG
CTGGAAGCGC ACGCGCGGTT GGCCCGCCGG CTACGCGGGC CAGCCACACA CCGGCGGCTA
CGTCCGCTGC CCACCGCGGC GACGGGCACG GCGGAGGTGC GCACGTTCCG TTCCGCCGCC
AGCGAGGCGG CCTGGCTGGC ACACGCGCTG CGGGCGGCGC ACCTGCTCGA CGGGGTGCCC
TGGGCTGAGA TGGCCGTGTT GGTGCGCTCC ACCGCGCGAC AGCTGCCCTC GCTGCGACGC
GCCCTGCACG CTGCCGGCGT GCCCACCGTG GTCCACGGCG AGGACCTGCC ACTGCACCTG
CAACCGGCGG TCGCCCCGCT GCTGCTCCTG CTGCGCTGCG CGCTTGCCCC GGACCGGCTC
GACGAGGAGG CGGCCGTCGC CCTGTTGCAC TCGCCTCTCG GCGGGGCGGA TCCGTTGGCC
GAACGGCGGC TGCGGCAGGG CCTGCGGGTC ATGGCGTTGG CCGCCGGCGA CCGCCGACCC
TCCGGCGAGT TGATCACCGA GGCGCTGCGT GACCCGGCCG AGCTGGCCGG CATCGACCGC
CGGTGGGCGG AGCCGGCGCA GACGGTGGCC GGTCTGCTGG CGGTCGCCCG GGAGACGGCG
GCTCGGCCGG GGGTCACCGC CGAGGAGGTG CTCTGGGCGG TGTGGCGGGC CAGCGGCCTG
GCCGAGCTGT GGTCGGCGGC GCTGGCCCGA ACCCCGGCGA CAGCCGGGGA GGGGGATGTC
GCCCGGCGTC GACGCGCCGA GGCGGCCGAC CGGGACCTGG ACGCGGTGAT GGTGCTGTTC
GACGCCGCCG CCCGGTTCAC CGACCGGCTG CCCGGGGCGC GCGTCGAGGT CTTCCTCGAC
CACGTGTTGG GGCAGGACCT GCCGGCCGAC ACCCTCGCCC CGACCGCCGA CCGGGGCGAC
GCGGTCCGGC TGCTCACCGC GCACGCCGCG AAGGGGCTGG AGTGGGATGT GGTCGCCGTC
GCCGGAGTAC AGGAGGGTGT CTGGCCCGAC CTGCGGCTGC GGGGCAGTCT GCTCGGCTCG
GAGCGCCTCG TCGACGTGCT CGCCGGCCGC TCGGCCGGCG CCGGGCCGCG GGCCTCCGTC
GTCGGCCAGA CCTCGGCTCT GCTGGACGAG GAACGGCGAC TCTTCCACGT CGCGGTGACC
CGGGCGCGAC GCCGACTGCT GGTCAGTGCG GTCGCCAGCG CCACCGTCGG TGGTGACGAC
CACGATGAGC AACCGAGCCG GTTCCTCCAT GAACTCGGCG TGGCGGATCC GCCGCCGGCA
GCTGCCGCGG GACCGGACGG GGACGCCGCG TCGGACGAGG CCCGGACGGC CCCGCTGCCG
CTGAGCCGGC CACCCCGGGC GTTGACCCTG GCATCGCTGG TGGCCGAGCT GCGTGCCGCG
GTGACCGACC CGGCCTCACC CGCGCTCCGG CGTCGGGCGG CAGCGGCCGA GCTGGCCCGA
CTCGCCGCCG AAGGAGTCGC CGGAGCGCAC CCGAGCGACT GGTGGGGGCT GCGTCCGCTC
TCCGACGACC GGCCGCTGGT CGCTGACGGA GAGCCGGTGC GGGTCACCCC CTCGGGGATG
GAGAGCGCCC TCCGATGCAG CCTGCGCTGG CTGCTCGAAC GTCACGGGGG TAGCGCGCCG
GCCGGTGCCG CCCAGGGGGT GGGGAACCTG GTGCACGCTG CCGCGATGCT GGCCGAGAGC
GCCAGCTCGG ACCGGCGGGC GCTGCTCGAC TACGTGGCGT CCCGGTTCGA CGCGATCGAG
CTGGCAGCCC GCTGGATGGT CGGGCCGGAG CGAGCCCGAG CCGAGGCGAT GGTGGACAAG
CTGCTGCGCT GGCTGGCCAC CAACCCGCGC CGGCTGCTCG CCATCGAGCA CGAGTTCGCC
GTCCGGCTGA ACGACCCACG GCACCCGGTT GACCTGGTCG GCCGGGTGGA CCGGCTGGAG
GTCGACGCGG ACGGCCGGCT CGTGGTGGTC GACCTGAAGA CCGGCAAGTC GACCGCGGTC
ACCGCGGCTG ACCTCGCGGA GCATCCGCAG CTGGGCGCGT ACCAGGCGGC GGTCGAGGCG
GGAGCGTTCG CCGAGTTCGG GGCGGAGTCC GGGGGAGCCG CCCTGGTGCA GCTCGGCACC
GCAGCCAAGG ACGCCCGGGA ACAGTCCCAG CCGCCGGCCG GTGAAGGGCC GGCGGCCGGC
TGGGCGACCG CCCTGGTCCG GCGTACCGCC GATACGATGG CCGCCGCCAC CTTCGCTGCG
GTCGCCAACT CGAAGTGTCG GGTCTGCCCG GTACGCGCGA GCTGTCCAGT ATCCGGACGG
GGACGTCAGG TCGTCGAACC GCCGAGCATC CGCCCGTCGG AGGAACCGTG A
 
Protein sequence
MAAGRIDDPR QLKVVGHTAG PMLVLGGPGT GKTTTLVEAV AARVAEGVDP ERVLVLTFSR 
RQAASLRQRI EARIAGDGHR VLREPLVRTF PAYAFGLLRR AAAERGEPSP RLLTGPEQDL
IIRELLDLVG EEPDNDPVGW PEDLRPALRT RAFAAQLRDL LMRAAERGIG PVELARLGER
LGRADWPAAA RFLREYVAVL ALRDVGNRGS IAYDPAELVR SATGMLLDDP ALLAAERRRL
AHVYVDEYAD TDPAQQDLLA TVAGDGKALV VLADPDSSTY AFRGADPGGV VTFPHRFRTA
SGAPAAQVVF TTSYRAAPRL LEAHARLARR LRGPATHRRL RPLPTAATGT AEVRTFRSAA
SEAAWLAHAL RAAHLLDGVP WAEMAVLVRS TARQLPSLRR ALHAAGVPTV VHGEDLPLHL
QPAVAPLLLL LRCALAPDRL DEEAAVALLH SPLGGADPLA ERRLRQGLRV MALAAGDRRP
SGELITEALR DPAELAGIDR RWAEPAQTVA GLLAVARETA ARPGVTAEEV LWAVWRASGL
AELWSAALAR TPATAGEGDV ARRRRAEAAD RDLDAVMVLF DAAARFTDRL PGARVEVFLD
HVLGQDLPAD TLAPTADRGD AVRLLTAHAA KGLEWDVVAV AGVQEGVWPD LRLRGSLLGS
ERLVDVLAGR SAGAGPRASV VGQTSALLDE ERRLFHVAVT RARRRLLVSA VASATVGGDD
HDEQPSRFLH ELGVADPPPA AAAGPDGDAA SDEARTAPLP LSRPPRALTL ASLVAELRAA
VTDPASPALR RRAAAAELAR LAAEGVAGAH PSDWWGLRPL SDDRPLVADG EPVRVTPSGM
ESALRCSLRW LLERHGGSAP AGAAQGVGNL VHAAAMLAES ASSDRRALLD YVASRFDAIE
LAARWMVGPE RARAEAMVDK LLRWLATNPR RLLAIEHEFA VRLNDPRHPV DLVGRVDRLE
VDADGRLVVV DLKTGKSTAV TAADLAEHPQ LGAYQAAVEA GAFAEFGAES GGAALVQLGT
AAKDAREQSQ PPAGEGPAAG WATALVRRTA DTMAAATFAA VANSKCRVCP VRASCPVSGR
GRQVVEPPSI RPSEEP