Gene Sare_3518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3518 
Symbol 
ID5704646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4059009 
End bp4060766 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content71% 
IMG OID641272945 
Producthypothetical protein 
Protein accessionYP_001538311 
Protein GI159039058 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.663453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00905238 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACCGC AGGAGTATGT CCAGGAGTCG TTGGCCGGTA TCGATCCGGC CGTCGGCGGA 
GTTGACCCGG AGCTACCGCT CTACGCGACC ACCTTCGTGG TGGTCGACCT CGAAACCACC
GGCGGTGCGC CGGACGGCGG TGGCATCACC GAGATCGGCG CGGTGAAGGT ACGCGGCGGT
GAGCAGCTCG GTGTGCTCGC CACCCTGGTC AACCCCGGCG TGCCGGTGCC ACCCTTCGTC
ACCGTGCTCA CCGGCATCAC CCAGGCGATG CTGCTGCCTG CCCCGTCGAT CGCCCAGGTG
CTGCCGAGCT TCCTGGAGTT CCTCTCCGCC GACGCGGTCC TCGTCGCCCA CAACGCCCCA
TACGACGTGG GGTTCCTCAA GGCGGCGTGT GCGGCGCACG GCCACCGCTG GCCGAATCCG
CGGATACTGG ACACGGCGGC GCTGGCCCGG CGGGTCCTCA GCCGGGACGA GGTGCCCAAC
CGCCGGCTCG GCACCCTCGC GGCATACTTC CGCACCAGCA CCCAGCCCAC CCATCGGGCA
TTGGACGACG CGAAGGCCAC AGTCGACGTG CTGCACGGGT TGATTGCCCG GCTGGGCGGC
CACCGGGTGC ACACCATCGG CGAGGCGATC GAGTTCGCCC GGGCGGTCAC CCCCACGCAG
CGTCGAAAGC GGCACCTCGC CGAGGGGCTA CCCCGTACGC CCGGCGTCTA CCTCTTCCGG
GGCGCCGACG ACCGCCCGCT CTACGTCGGC ACCTCGGTTG ACATCGCCAC CCGGGTCCGC
AGCTACTTCA CCGCCGCCGA GAAGCGGGCC CGCATCTCGG AGATGCTCGC CGCCGCCGTG
CGGGTGGAGG CGGTGGAGTG CGCTCACTCG CTCGAGGCAG AGGTGCGGGA GCTGCGACTG
ATCGGGGCGC ACGCCCCGCC GTACAACCGG CGGTCCAAGT TCCCCGAGCG GGTGGTCTGG
CTGAAGCTGA CCGACGAGGC GTACCCACGG TTGTCGGTGG TTCGCAAGCT GACCCCGGGT
GACGAGGCAT ACCTCGGACC CTTCACCTCC CGCCGTGCCG CGGAGCTCGC CGCCGCCGGC
TTCCACGACG CCATGCCGCT GCGGCAGTGC ACGCACCGAC TGTCGGTGCG GACGGTCACG
CCGGCCTGCG CGTTGGCGGA GTTGGGTCGC TGCCCGGCGC CCTGCGAGCA TCGGATCACC
CCCGACGAGT ACGCGAACCG AGCGGTGACG CCCTTCCGCA CGGCGTGCCG CGGCGACCCG
CAGGGCGTGG TGGATGCGTT GCTTGGCCGG ATCGAGACGC TCTCGACGGG ACAACGCTAC
GAGGAGGCCG CGGTGGTGCG GTCCCGGCTC ACCGCCGTGC TCCGCGCCAC GACCCGGATG
CAGCGCCTCG CCGCGCTCAC CGGGATCGGC GAGGTGGCGG CTGCCCGGCC GGCGGTGGGC
GGAGGCTGGG AGCTGGCGCT GGTGCGGTAT GGACGGCTCG CCGGTGCCGG TGTGTCACCG
CCGGGTGTCC ACCCGCGACC GACGATCGCC ACGATTCGGG CTACTGCGGA GACAGTGACC
CCGGGGTTGG GACCGACTCC AGCTGCCAGC GCCGAGGAGA CCGAACGCAT CTTGTCCTGG
TTGGAACGTC CGGAGACGAG ACTGGTGGAG ATGTCCTCCG GCTGGGCCTC CCCGGCGACT
GGGGCGGGCC GGTTCCAGAC TCTGCTGATG AAGGCGACGA ACGCCGCTTC CCACCAACTA
TCATCCGAAA GCCCATGA
 
Protein sequence
MAPQEYVQES LAGIDPAVGG VDPELPLYAT TFVVVDLETT GGAPDGGGIT EIGAVKVRGG 
EQLGVLATLV NPGVPVPPFV TVLTGITQAM LLPAPSIAQV LPSFLEFLSA DAVLVAHNAP
YDVGFLKAAC AAHGHRWPNP RILDTAALAR RVLSRDEVPN RRLGTLAAYF RTSTQPTHRA
LDDAKATVDV LHGLIARLGG HRVHTIGEAI EFARAVTPTQ RRKRHLAEGL PRTPGVYLFR
GADDRPLYVG TSVDIATRVR SYFTAAEKRA RISEMLAAAV RVEAVECAHS LEAEVRELRL
IGAHAPPYNR RSKFPERVVW LKLTDEAYPR LSVVRKLTPG DEAYLGPFTS RRAAELAAAG
FHDAMPLRQC THRLSVRTVT PACALAELGR CPAPCEHRIT PDEYANRAVT PFRTACRGDP
QGVVDALLGR IETLSTGQRY EEAAVVRSRL TAVLRATTRM QRLAALTGIG EVAAARPAVG
GGWELALVRY GRLAGAGVSP PGVHPRPTIA TIRATAETVT PGLGPTPAAS AEETERILSW
LERPETRLVE MSSGWASPAT GAGRFQTLLM KATNAASHQL SSESP