Gene Sare_3366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3366 
Symbol 
ID5707990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3886126 
End bp3888228 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content65% 
IMG OID641272792 
Productexcinuclease ABC subunit B 
Protein accessionYP_001538159 
Protein GI159038906 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00385634 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCTCG ACATTCCCCG GCTCGACGGC CGTTTCCAGG TCGTCAGTGA TTTTCAGCCG 
GCCGGTGACC AGCCGGCCGC AATCGACGAC TTGGAGCGGC GGGTCCGGCG TGGTGACCGG
CACACGGTGT TGCTCGGTGC GACCGGCACC GGCAAGAGCG CCACCACGGC GTGGTTGATT
GAGCGGTTGC AGCGCCCGAC GCTCGTGTTG GCCCCCAACA AGACGCTCTG CGCCCAGCTG
GCCAAGGAAT TCAGCGAGCT ACTGCCGAAC AACGCGGTGG AGTATTTCGT CTCGTACTAC
GACTATTACC AGCCCGAAGC GTACATTCCG CAGACCGACA CCTACATCGA GAAGGACTCC
TCGATCAACG AGGAGGTCGA GCGGCTGCGG CACTCGGCGA CGATGTCACT GCTCACCCGT
CGGGACGTGG TGGTGGTCGC CACCGTCTCG GCGATCTACG GCCTGGGCAC GCCCGAGGAG
TACCTGGACC GTGCCGTCCG CGTAGCGACC GGTCAGGAGT TGGATCGCGA CCAGTTGCTG
CGCCGGCTGG TCGACATTCA GTACACCCGC AACGACATGG CCTTCCAGCG GGGCACGTTC
CGGGTCCGTG GCGACACCTT GGAGATCATT CCCGCGTATG AGGAACTGGC GGTCCGGATC
GAGCTGTTCG GGGACGAGGT GGAGCGGCTC TACTACCTCA ACCCGCTCAC CGGCGATGTG
GTCCGGGAGG TCGACCACCT GCTGATCTTC CCGGCCACGC ACTATGCGGC TGGTCCAGCT
CGAATGGAGC GGGCGATTCG CGATATTGAG ACGGAGCTGG GCGAGCGGTT GGCCGAGTTG
GAACGGCGGA GCAGCCTGCT GGAGGCGCAG CGGCTGCGGA TGCGGACGAC GTACGACATC
GAGATGATGC GGCAGGTCGG TTTCTGCTCC GGCATCGAGA ACTATTCGAT GCACATCGAC
GGGCGGCTGC CCGGCAGCCC GCCACACTGC CTGCTCGACT ACTTTCCGGA TGACTTCCTC
ACTGTGGTCG ACGAGTCGCA CGTGACGATC CCGCAGGTCG GCGGGATGTA CGAGGGCGAC
GCGTCCCGCA AACGGATGCT GATCGACCAC GGCTTCCGAC TGCCCAGCGC CGCCGACAAC
CGACCGCTGC GCTTCGACGA GTTCCTGGAG CGGGTCGGTC AGATGGTCTT CCTCTCGGCC
ACCCCCGGCT CGTGGGAACT GGAGCACGCG CAGGGCGAAT ATGTGGAGCA GGTCATTCGA
CCGACCGGTC TGGTCGACCC GGAGGTGGTG GTCAAGCCGA CCAAGGGTCA GATCGACGAC
CTCATGCACG AGATCAAGCT GCGCACCGAG CGCGACGAGC GGGTGCTGGT CACCACGCTG
ACCAAGAAGA TGGCCGAGGA TCTGTCGGAC TATCTTCTGG AGAACGGTAT CCGGGTGCGT
TATCTGCACT CCGAGGTGGA CACCCTGCGC CGGGTCGAAC TGCTACGCGA GCTGCGTAAG
GGTGACTACG ACGTGCTGGT CGGCATCAAC CTGCTCCGGG AGGGGCTCGA TCTGCCGGAG
GTGTCCCTGG TCGCCATCCT CGACGCGGAC AAGGAGGGCT TTCTGCGTAG CGGCCGGTCG
TTGATCCAGA CGATCGGGCG GGCGGCGCGT AATGTCTCCG GTGAGGTGCA CATGTACGCC
GACAAGATCA CTCCGTCGAT GGCGGAGGCG GTCGGGGAGA CCAATCGGCG TCGGGCCAAA
CAGATCGCCC ACAATGAGGC ACACGGCATC AGCCCGGAGC CGCTGCGTAA GAAGATTCAC
GACATTCTTG ACGACATCTA TCGTGAGGCG GAGGAGACCG AGACCCGCGT CGGTGGAGCG
GTCCGGCAGC TCTCCCGTGG CAAGGCCCCG GTGAAGGAGA CCCGCAGCCG CAGTCGGGCC
GGCGCCGGAC CAGCCCGGGA GGGGATGGCT CGTGCTGAGT TGGCCGAGTT GATCCAGGAG
CTGAATGGCC AGATGCTCGC CGCTGCCCGG GAGTTGCAGT TCGAGTTGGC CGCCCGAATC
CGCGACGAGA TCTCCGAGTT GAAGAAGGAA CTGCGCGGGA TGGACGCCGC GGGGGTGACG
TGA
 
Protein sequence
MALDIPRLDG RFQVVSDFQP AGDQPAAIDD LERRVRRGDR HTVLLGATGT GKSATTAWLI 
ERLQRPTLVL APNKTLCAQL AKEFSELLPN NAVEYFVSYY DYYQPEAYIP QTDTYIEKDS
SINEEVERLR HSATMSLLTR RDVVVVATVS AIYGLGTPEE YLDRAVRVAT GQELDRDQLL
RRLVDIQYTR NDMAFQRGTF RVRGDTLEII PAYEELAVRI ELFGDEVERL YYLNPLTGDV
VREVDHLLIF PATHYAAGPA RMERAIRDIE TELGERLAEL ERRSSLLEAQ RLRMRTTYDI
EMMRQVGFCS GIENYSMHID GRLPGSPPHC LLDYFPDDFL TVVDESHVTI PQVGGMYEGD
ASRKRMLIDH GFRLPSAADN RPLRFDEFLE RVGQMVFLSA TPGSWELEHA QGEYVEQVIR
PTGLVDPEVV VKPTKGQIDD LMHEIKLRTE RDERVLVTTL TKKMAEDLSD YLLENGIRVR
YLHSEVDTLR RVELLRELRK GDYDVLVGIN LLREGLDLPE VSLVAILDAD KEGFLRSGRS
LIQTIGRAAR NVSGEVHMYA DKITPSMAEA VGETNRRRAK QIAHNEAHGI SPEPLRKKIH
DILDDIYREA EETETRVGGA VRQLSRGKAP VKETRSRSRA GAGPAREGMA RAELAELIQE
LNGQMLAAAR ELQFELAARI RDEISELKKE LRGMDAAGVT